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ABSTRACT 

This study attempted to evaluate the effects on pupil 
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consistent set of findings related to the ineffectiveness of the 
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language-cognition Spanish groups receive scores higher than the 
other experimental or control groups. The scores of the children in 
all the treatments, experimental and control, at both grade levels 
were very low. (HR) 
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PREFACE 



This study by Mary Ellen Swanson SaUnas was 
originally intended as primarily an analysis of the 
San Antonio Language T^esearch Project data for its 
fourth year (1967-63). However, due to the fact 
that data analysis was delayed until the 1971-72 
year, the scope of the report was escpanded to include 
a review and evaluation of the entire project on a 
longitudinal basis. Mrs. Salinas is to be comnended 
fci her objective, straightforward, frank and honest, 
yet sympathetic, analyses of the trials, tribulations, 
successes (admittedly few) and failures of a relatively 
early under-financed attempt at longitudinal research 
in educational innovations for children of poverty. 

In the words of Theodore Sizer ( Phi Delta 
Kagean. June. 1972. p. 635). former dean of the Harvard 
Graduate School of Education and headmaster to be at 
Phillips Acadeoay. Andover. 

"TO be honest with ourselves in education is 
neady medxcxne. requiring not only courage 
but self-confidence, m raust be sure that 
we can. when faced with fresh evidence or 
new conditions, shift our manner of work- 
even our belief s— fundamentally. 



Thomas 0. Horn 



The University of Texas at Austin 
August. 1972 
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CHAPTER I 



INTRODUCTION 

A sample survey made in November, 1969, by the U.S. 

Bureau of the Census found that 9,230,000 persons— about 

five percent of the approximately 200,000,000 people in 

the united States — reported that they were of Spanish 

origin,! e.g., Mexican, Cviban, Paerto Rican or from other 

Spanish- speaking countries. Convparisons regarding income 

and enployment included the findings that: 

Spanish origin family income was on the average 
$5,600, about 70 percent of that of other fami- 
lies. 

Spanish origin persons had an \inenployment rate 
of 6.0 percent, abou":. 1.7 times the unemployment 
rate of all other persons in the labor force. 

Spanish origin workers were less likely to hold 
white-collar jobs than the remainina eitployed 
populations of the United states. 2 

Sixty-one percent of the people of Spanish origin 
lived in the Southwest, i.e., Texas, California, Colorado, 
New Mexico and Arizona.'^ Five million, seventy-three 



■^U.S. Bureau of the Census, Current Population 
Reports, Series P-20, No. 213, "Persons of Spanish Origin 
in the United States: November 1969," U.S. Government 
Printing Office, Washington, D.C., 1971, pp. 1-2. (Here- 
after referred to as "Persons of Spanish Origin.") 

2 Ibid . , p. 2. 

^Ibid. 
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thousand or fifty-five percent of the persons of Spanish 
origin identified themselves as being of Mexican origin. 
Eighty-six percent of these people lived in the Southwest.-^ 
There was no breakdown by state, but another study estimated 
that eighty-five percent of the Mexican- Americans who live 
in the Southwest, live in California and Texas. 

Further evidence of population concentration is the 
fact that thirty-three and one-half, percent of the almost 
three and one-half million persons with Spanish surnames 
lived in fourteen cities in the Southwest, as of 1960.3 
Data on the Spanish population in the 1950 census and 1960 
census were based on the less inclusive measure of "Spanish 
surname." Many persons of Spanish origin do not have Spanish 
surnames. Within Texas, persons with Spanish surnames are 
concentrated in the southern part of the state so that they 
are the majority ethnic group in cities such as Brownsville 
and Laredo.^ 



^Ibid. , p. 4. 



2 

Walter Fogel, "Mexican-Americans in Southwest 
Labor Markets," Mexican-American Study Project Advance 
Report J.0, University of California, Los Angeles, 1967, 

^bert W. i^acMillan, "A study of the Effect of 
Socioeconomic Factors on the School Achievement of Spanish- 
f?™S? ^?}^°^ Beginners," Unpublished Doctoral Disserta- 
tion. The University of Texas. Austin. 1966. pp. 35 and 38. 

_ ^ ^rschel T. Manuel. Spanish Speaking Children of 
p. 29?^^ The University of Texas Press/ Austin. 1965. 



Mexican- Americans have been characterized histor- 
ically as a iinique group because of their high degree of 
homogeneity. Being Mexican-American has been almost 
synonymous with being poor. Saunders reported in 1949 
that: 

Everywhere . . . there is poverty. Not all 
Spanish- speaking are poor, but in general more 
of them are poor than is true for any other 
group. ... If one were to attempt to charac- 
terize the conditions of the Spanish- speaking 
Texans, he would be forced to say that, in 
general, and for nearly any index of socio- 
economic status that might be devised, the 
Spanish-speaking people are found to occupy a 
less desirable position than the Anglos or the 
population as a whole. 

Conclusions by Bogue about Mexican- Americans in the 
1950 's revealed no substantial change from Saunders' obser- 
vations of the 1940 's. 



As a group, immigrants from Mexico have a very 
low educational attainment. Although the 
educational attainment of the second generation 
IS somewhat higher than most of the first genera- 
tion, it is still far below that of the native 
stock or of any other immigrant group. Mexicans 
are einployed largely as unskilled laborers, and 
there is not a great deal of difference between 
first and second generations in this respect. 
The Mexican is the most poorly educated member 
of the nation's population, with an educational 
level lower than that of the Puerto Rican. This 
is the only ethnic group for which a coiiparison 



^Lyle Saunders, Address given to the National 
Convention, League of united Latin American Citizens, San 
Antonio, Texas, June 11, 1949. pp. 7-8 (Mimeographed), 
quoted by Carey McWilliams, Brothers Under the Skin . 
Uttle, Brown and Company, Boston, 1951, pp. 131-2. 



of the characteristics of the first and second 
generation fails to show a substantial inter- 
generational rise in socio-economic status.-^ 

Data from the 1960 's show continued lack of prog- 
ress. In November, 1969, 6.2 percent of the persons of 
Mexican origin who were 16 years of age or older and were 
in the labor force were unemployed. At that time, only 
3.6 percent of all people in the labor force 16 years old 
and over were unemployed.^ 

In a classification of occupations that included 
fam workers, service workers, blue-collar workers and 
white-collar workers, only 18.5 percent of Mexican males 
were white-collar workers in November, 1969. This com- 
pared to figures of 25.3 percent for all males of Spanish 
origin and 41.4 percent for men of all other origins. 
Figures for female workers were as follows: 37.7 percent 
of those of Mexican origin, 40.9 percent of those of 
Spanish origin, and 60.7 percent of all other origins were 
white-collar workers.^ 

The median income for households with a head of 
Spanish origin in November, 1969, was $5 , 641— conpared to 



«^ ^ tP°^^^^ Bogue, The Population of the United 
States, The Free Press, Glencoe, 111., 1959, p. 372. — 

^"Persons of Spanish Origin," p. 34. 



$8,011 for all other origins and $5,488 for heads of Mexi- 
can origin.^ 

Clearly, the socioeconomic condition of the Mexican- 
American has not improved spectacularly in the last 30 years. 
Additional data can be found in Appendix A which makes the 
situation of the Mexican- Americans in the Southwest, Texas 
and San Antonio even more graphic. 

Judging from what is known about the difficulties 
that economically disadvantaged children have in school, 
one would eacpect Mexican-Americans to have attained only 
minimal levels of educational achievement. ^ Data confirm 
this expectation, of persons aged 20-49 in Texas, one-third 
of the Anglos have not graduated from high school, while over 
three-fourths of the Mexican- Americans have not. In seventh 
through twelfth grades the dropout rate of Mexican-Americans 

is almost twice that of Anglos, and Mexican- Americans drop 

3 

out sooner. 

In addition to the problem of poverty that the 
typical Mexican-American child brings to school, he also 



^ Ibid . . p. 34. 

2paul A. Witty (ed. ), "The Educationally Retarded 
and Disadvantaged, " Sixtv-Sixth Yearbook for the National 
Society for the Stu dy of Education. Part I . University n-F 
Chicago Press, Chicago, 1967. 

^Governor's Committee on Public School Education, 
The Challenge and the Chance. Governor's Committee on 
Public School Education, Austin, Texas, 1963 (non-paged). 
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is likely to be deficient in English language skills. A 
1957 report of the Texas Education Agency indicated that 
at that time, the average child in Texas with a Spanish 
surname was spending three years in first grade .and was 
dropping out of school before reaching the fifth grade. 
This educational deficiency was correlated by Andersson 
with inability in English.^ 

Similarly, a survey involving 123 Texas school 
districts in 1962 found that: 

The failure rate of Spanish- speaking children 
Without any preschool instruction in the first 
grade was 32 percent. A major reason for this 
failure was reported to be their inability to 
comprehend and use English. 2 

In 1964 in a predominantly Mexican-American area 
of San Antonio, Texas, 98 percent of the children entering 
first grade spoke no English.^ 



Theodore Andersson, "Bilingual Education: The 
American Eacperience, " Paper presented at the Ontario 
Instiitute for Studies in Education Conference on Bilingual 
Education. Toronto. March 13, 1971. p. 11 (Mimeographed), 
citing Texas Education Agency, Division of Research, 
"Reports of Pupils in Texas Public Schools Having Spanish 
Surnames, 1955-56," Texas Education Agency, Austin, 1957. 
2 

M*-,^^ ^^^^'f*® ?• ^'owler, "An Evaluation of the Brengelaan- 
^^Sfm^i?^*^'' Capacity Index as a Predictor of Reading 
i^Jltnf^S SPf-^i^'^-Speaking First Grade Children," Un- 

A^tin Dissertation, The University of Texas. 

«n ii i' ^^^Ing Texas Education Agency. A Report 

°" fi Instructional Prewar ^ f o r Sn-EnrrlTl^r-^ v- 

j.nq Chiidre ji, Texas Education Ag ency, Austin, 1962. 

Oral-Aurf r??S?^ J^''^^ ^^^^^^^ Intensive 

oral-Aural English Language Instruction, OraJ.-Aural Spanish 



Despite the recent proliferation of so-called 
bilingual education programs, the "problem" of children 
coining to school without well-developed English language 
skills is not likely to disappear in the United States in 
the near future. 

As mentioned earlier, according to a sanple survey 
made in November, 1969, there were b», 230, 000 persons in 
the united states who identified themselves as being of 
Spanish origin, of these, 6,700,000 reported Spanish as 
their mother tongue. Spanish was the current home 
language of 4,622,000 persons.^ Of all persons of Mexi- 
can origin, 47.3 percent (or 2,401,000 persons) reported 
that Spanish is the language usually spoken in the home.^ 
Given the socio-economic history and present con- 
dition of many Mexican-Americans, combined with the census 
data regarding language currently spoken in the home, it 
is safe to assume that for some time to come many Mexican- 
American children in the Southwest will be doubly penalized 
in school—because of their socio-economic status and 
because of their English linguistic abilities. 



Language Instruction and Nbn-Oral-Aural Instruction on 
Reading Readiness in Grade One , " The University of Texas . 
Austin, 1966, p. 14. 

^"Persons of Spanish Origin," p. 1 
^ Ibid . . p. 14. 
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lyiPORTANCE oy THE STUDY 
The west side of San Antonio, Texas, is densely 
populated by disadvantaged Mexican- Americans. In an 
effort to precipitate change in the schools and to develop 
and demonstrate specific pedagogical techniques, the San 
Antonio Language Research Project was inaugurated in the 
fall of 1964 in ten west-side San Antonio schools. The 
Project was directed by Thomas D. Horn and had the coopera- 
tion of the San Antonio .Independent School District. 

Since the assxartptions were that oral and listening 
English language skill levels of Mexican-American children 
were insufficient for successful academic achievement, the 
educational techniques used in the ejqjerimental treatments 
were various methods of oral language instruction. One 
treatment consisted of intensive oral-aural language 
instruction in English; another, intensive oral-aural 
language instruction in Spanish; and a third, unstructured 
language activities in English. All three treatments used 
science materials as content. 

The original stated purpose and research focus of 
the Project in Year One (1964-65) was to "compare the 
effectiveness of three methods of developing reading 
readiness in Spanish-speaking boys and girls in the first 
grade. While the efforts to iirprove oral language 



Horn, 0£. cit. , pp. 3-4. 



continued into successive years, the goals changed from 
reading readiness to reading proficiency and achievement, 
to academic achievement in general. The San Antonio 
Language Research Project was in effect e3q)loring two 
major questions: 

(1) Are structured audio-lingual techniques more 
effective than tore informal language activities for 
inproving oral English and Spanish? 

(2) Will inprovement in oral language be re- 
flected in iinproved achievement in reading and other 
academic subjects? 

Several doctoral dissertations, articles, and 
research reports concerning the Project have been com- 
pleted. Four of these are viewed as reporting the results 
of the eacperimental treatments at the end of successive 
school years for children entering first grade in Years 
One (1964-65) and Two (1965-66). ^ Taylor, Year Five 
(1968-69), attempted to answer the first major question 
mentioned above while the authors of the other three 
yearly reports (Year One— 1964-65, Year Two~1965-66, and 



^v^^ r.- — ^chard D. Arnold, "1965-66 (Year 

Tv#oJ Findings, San Antonio Language Research Project, Thomas 

« ^i^'^J?^' university of Texas, Austinri962r 

Lester N. Knight, "A Coitparison of the Effectiveness of 
Jn^f?''i«^ ?''5^;^^f^" English Instruction, Intensive Oral- 
?i Spanish Instruction, and Nbn-Oral-Aural Instruction 
SJ^^S®/. Achievement of Spanish-Speaking Second and 

Third-Grade Pupils," Unpublished DoctorSl Dissertation, 



Year Three— 1966-67 ) assumed that language had inproved 
and tried to provide answers to the second question. 

If one is willing to assume that the tests used 
to measure reading and reading readiness were equally 
valid for all treatment groups, then the following con- 
clusions can be made regarding the results during the 
first three years of the Project: after Year One (1964-65) 
there were; no significant differences between the treat- 
ment groups with regard to reading readiness. The findings 
in Years Two (1965-66) and Three (1966-67) were less clear 
and soiTtewhat contradictory, in general, however, the 
children who entered first grade in Year One (1964-65) 
were superior in reading achievement if they were in the 
treatment receiving unstructured (non-oral-aural) language 
activities. 1 The children who entered first grade in 
Year Two (1965-66) were superior if they received intensive 
oral-aural language instruction in English. ^ This could 
be due to benefits accruing from teachers' previous 
experience with the treatment. 



Taylor, "A Coaparative Study of the Effects of Oral-Auial 
Language Training on Gains in English LangSJo f" Fourth 
SSfnJi • ^ fS?^ Disadvantaged Me^can-SST^ CMld?e^? 
^^^^^19°°^"^°^*' Dissertation, The University of ?2ias, 

^Arnold, OE. cit . , p. 69; Arnold. SR- cit . , p. 215. 
^Arnold, 2E. , p. 70j Kni^t, ag- Sit-i p. 215. 
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In Year Four (1967-68) the info.itial language activ- 
ities of the non-oral-aural treatment were dropped. Then 
two main treatment groups were left at each grade: 
students who had received four or three years of intensive 
oral language instruction in English or in cpenish. Due 
to lack of adequate instruments to measure growth in oral 
language, no empirical evidence on oral language develop- 
ment was collected. There was no alternative but to 
assume that structured audio-lingual techniques were more 
effective than typical language activities for inproving 
oral language. The task became one of trying to answer 
the second major question: will improvement in oral 
language be reflected in iir$>roved achievement in reading 
and other academic subjects? 

By the end of Year Pour (1967-68) three additional 
questions became of special interest. First, would the patterns 
of superiority of the non-oral- aural treatment for the children 
who entered first grade in Year One (1964-65) and of the inten- 
sive English treatment for the children who entered first 
grade in Year Two (1965-66) continue into Year Pour (1967-68)? 
Second, would analysis of test scores after three or four 
years reveal the superiority of one treatment more clearly 
and dramatically than in previous years? Transfer of learning 
from oral language to reading may take place only after con- 
siderable training. Third, would the oral language training 



have any effect in acadenu.c areas? Up to this point, no 
one had investigated the effectiveness of the experimental 
treatments on school sxibjects other than reading. 

STATEMENT OF THE PRhRT.KM 

The problem in this study was to evaluate the 
effectiveness of experimental oral language programs at 
the end of three and four years of instruction. This was 
done by conparing the levels of intellectual skills and 
abilities of pupils in experimental third and fourth grade 
classes who had received intensive oral language instruc- 
tion in English or Spanish and of pupils in control classes 
in third and fourth grade who had received no intensive 
oral language instruction. 

INSTRUMENTATION 
The tests used to measure the level of intellectual 
skills and abilities were the Iowa Tests of Basic Skills , 
administered in the spring of Year Four (1967-68).^ The 
Iowa Tests of Bas ic Skills measure skills in five major 
areas: vocabulary, reading, language, work-study and 
arithmetic. Four subtests measure skills in the language 
area: spelling, capitalization, punctuation and usage. 
The work-study area is divided into three sections: map 



■^E.F. Lindquist and A.N. Hieronymus, Houghton 
Mifflin Co., Boston, 1964. 
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reading, reading graphs and tables, and knowledge and use 
of reference materials, in arithmetic the two subtests 
are: arithmetic concepts and arithmetic problem solving. 
Scores on The Lorae-Thorndike Intelligence Tests , adminis- 
tered in the fall of third grade, were used to estimate 
differences in learning capacity. ^ 

SAiMPLE DESCRIPTION 
The pupils tested in this study were in the Project 
schools during their entire school careers. It was estab- 
lished during Year One (1964-65) that, in almost every 
instance, these children were members of families of low 
socio-economic status and low educational attainment. ^ 
Their control of English at the beginning of first grade 
was informally assessed by several observers as being 
minimal or non-existent.^ 

During Year Four (1967-68) approximately 550 pupils 
in fourth grade and 675 pupils in third grade were members 
of experimental classes, unfortunately, not all of these 
pupils had always been in the same type of experimental 
treatment, or in an^r experimental treatment. Moreover, a 
few of the children had failed one or more grades. When 



w^^-,. "^I^iiig Lorge and Robert L. Thorndike, Houghton 
Mifflin Co. , Boston, 1957. 

2 

"MacMillan, o^. c^t . , p. 134. 
%orn, o£. sit., p. 14. 



14 

data for such children were eliminated, few pupils were 
left. Inconplete data elirdnai-ed still more pupils. There- 
fore, it was decided to include in the study data for all 
children identified as having received the same treatment 
for their three or four years in school. Random sampling 
from the e:>q3erimental classes was neither necessary nor 
advisable . 

The figures below show the nxamber of children who 
were included in this study from the intensive English and 
intensive Spanish classes: 

Fourth Grade Third Grade 

LCE 29 109 

LCS 60 40 

Henceforward, the intensive English treatment will be re- 
ferred to as the LCE treatment and the intensive Spanish 
treatment will be referred to as the LCS treatment. 

At the third grade level, one additional group was 
studied. These pupils had ijeen in non-oral-aural sections 
in first and second grades and were in LCE classes in 
third grade. The number of these pupils is shown below. 
Hereafter, these pupils will be referred to as members of 
the NOA-LCE group. 

Third Grade 
NOA-LCE 59 
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The children who served as "controls" in the data 
analysis were pupils in the same schools as the escperi- 
mental classes, who had never been in experimental classes, 
and for whom data were available. The table below reveals 
the number of children in the control group. 

Fourth Grade Third Grade 

Control 21 68 

HypOTHESES ~ 
The hypotheses for the fourth grade stated in null 
form follow: 

1. After four continuous years the LCE, LCS and 
Control groups will not differ significantly on the fol- 
lowing scores from the Iowa Tests of Basic Skills ; 

1 . 1 Vocabul ary . 

1.2 Reading Coitprehension. 

1.3 Language Skills— Spelling. 

1.4 Language Skills— Capitalization. 

1.5 Language Skills— Punctuation 

1.6 Language Skills — Usage. 

1.7 Language Skills — Total. 

1.8 Work- Study Skills — Map Reading. 

1.9 Wbrk-Study Skills— Reading Graphs and Tables. 

1.10 Work-Study Skills— Knowledge and Use of 
Reference Materials. 

1.11 Wbrk-Study Skills — Total. 
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1.12 Arithmetic Skills — Arithmetic Concepts. 

1.13 Arithmetic S3d.lls — Arithmetic Problem Solving. 

1.14 Arithmetic Skills — Total. 

1 . 15 Composite . 

The hypotheses for third grade stated in null form 

follow: 

2. After three continuc^us years the LCE, LCS, and 
NOA-LCE and Control groups will not differ significantly 
on the following scores from the Iowa Tests of Basic Skj,lls : 

2 . 1 Vocabulary . _ 

2 . 2 Reading Comprehension . 

2.3 Language Skills — Spelling. 

2.4 Language Skills — Capitalization. 

2.5 Language Skills — Punctuation. 

2.6 Language Skills— Usage. 

2.7 Language Skills — Total. 

2.8 Work-Study Skills — Map Reading. 

2.9 Work-Study Skills — Reading Graphs and Tables. 

2.10 Work-Study Skills — Knowledge and Use of 
Reference Materials. 

2.11 Work-Study Skills—Total. 

2.12 Arithmetic Slcills — Arithmetic Concepts. 

2.13 Arithmetic Skills — Arithmetic Problem Solving. 

2.14 Arithmetic Skills — Tol.. I. 

2.15 Composite. 



DESIGN AND METHOD OF ANALYSIS 
Analyses were applied to each grade separately. 
First, a series of descriptive statistics was produced. 
The mean IQ and mean score on each test of the Iowa Tests 
of Basic Skills were computed for each experimental and 
control group along with the appropriate standard devia- 
tions. The statistical iinit was the child, not the class, 
since there were no intact classes left. 

Second, analysis of variance was used to compare 
the mean IQ and Iowa Tests of Basic Skills scores of the 
various groups at each grade level. Third, analysis of 
covariance was used to coirpare the Iowa Tests of Basic 
Skills scores for the groups using the IQ scores as the 
covariable. 

ORGANIZATION OF REST OF THE STUDY 
Chapter II contains a history of the San Antonio 
Language Research Project including a description of the 
instructional methods, rationales for those methods, and 
an explanation of administrative controls ar J. sources of 
fxinds. Also included is a summary and critique of research 
pertaining to the Language Research Project. Chapter III 
relates in detail the procedures involved in the conduct 
of this study along with the findings. The summary, 
l:.mitations , conclusions, and recommendations may be found 
iji Chapter iv. 



CHAPTER II 
REVIEW OP PROJECT LITERATURE 



HISTORICAL PERSPECTIVE 

The San Antonio Language Research Project was 
inaugurated as Cooperative Research Project No. 2648 in 
the fall of 1964 with the cooperation of the San Antonio 
Independent School District, under the direction of 
Thomas D. Horn, chairman of the Department of Curriculum 
and Instruction, College of Education, The University of 
Texas at Austin. The Project was one of twenty-seven 
studies sponsored by the U.S. office of Education in an 
effort to determine the best methods of teaching beginning 
reading. 

The primary purpose of the Project in its first 
year was to "compare the effectiveness of three methods of 
developing reading readiness in Spanish- speaking boys and 
girls in the first grade. "1 The three methods were various 
types of oral language instruction that will be described 
later . 



j'^°™^f ^' Horn, "A Study of the Effects of Inten- 
rin-°^*T"^^*^ "^^^^"^ Language Instruction, olal-ASal 
Spanish Language Instruction and Non-Oral-Aural Instruction 

at ?S???n^ University of Texas 

§ne Fin^Ags!")' (Hereafter referred to%s "Year 
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Although the primary purpose of the first year was 
limited to development of reading readiness, the actual 
scope of the Project was much larger. According to Horn, 
the basic objectives were: 

(1) To break through current methods and materials 
which have resulted in pupil failure, retention 
and ultimate school -^rop-out. 

(2) To achieve a restruc turing of teacher attitudes 
toward the Spanish- speaking school beginner which 
would foster a wholesome learning environment. 

(3) To delineate in meaningful terms the forces 
affecting academic achievements by Spanish- 
speaking pupils, forces for which the schools 
may or may not be able to provide some form of 
coii5)ensatory education. 

(4) To provide a reasonably defensible research base 
for using socio-economic and/or psychological 
data to predict the level of pupil achievement 
in school. If this is found possible, schools 
would be in a position to research compensatory 
educational programs in terms of their ability 
to overcome those factors which evidence a 
negative effect on school achievement. 

(5) To obtain data which will enable defensible 
decisions to be made concerning (^) the role 
of oral language in the education of Spanish- 
speaking pupils; (b) the nature of "bi-lingualism" 
with particular reference to both positive and 
negative potentials for curriculum planning; 

(c) ways in which an educational program might 
be planned for Spanish- speaking pupils which 
would simultaneously develop cognitive and lin- 
guistic skills, using the basic content areas, 
e.g., science, social studies, mathematics, health 
and the like, as vehicles for language skill de- 
velopment; and (d) ways of enabling Spanish- 
speaking pupils to develop positive self-concept. 

(6) To identify or develop diagnostic and achieve- 
ment tests which are valid and reliable for the 
Spanish-speald.ng population. 

(7) To develop more effective and pertinent in-service 
toacher education programs for teachers of Spanish- 
speaking pupils. 



^Ibid . , pp. 7-8. 
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These objectives were maintained throughout the 
life of the Project, though limited resources prevented the 
fulfillment of each objective. The experimental treatments 
always consisted of oral language programs, but the orig- 
inal goal of developing reading readiness changed to one 
of improving reading skills. 

The Project in Year One (1964-65) was supported by 
funds mainly from the U.S. Office of Education through the 
Cooperative Research Program and miniriially from the Research 
and Development Center for Teacher Education, College of 
Education, The University of Texas at Austin. The Project 
was extended into Year Two (1965-66) and supported by funds 
from The University of Texas Research and Development Center 
for Teacher Education and by Elementary and Secondary Edu- 
cation Act (eSEA) Title I funds from the San Antonio Inde- 
pendent School District. In Year Three (1966-67) ESEA 
Title I f\mds again provided support while the R and D 
Center made possible the statistical analysis of data from 
the previous year and gathering of data for the current 
year. 

In Year Pour (1967-68) funds were provided through 
Title I and the Southwest Educational Development Laboratory 
(SEDL) in Austin. The R and D Center provided minimal funds 
for research purposes using existent data only. The data 
analyses for 1965-66 were finished and the data for 1966-67 
and 1967-68 were collected and organized. 
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In Year Five (1968-69) financial support for the 
development and implementation of the special curriculum 
was provided by SEDL and Title I and III fiinds of the San 
Antonio Independent School District. No funds were avail- 
able to The University of Texas at Austin for research 
purposes. The Project was extended into Year Six (1969-70) 
under the auspices of SEDL. 

For the first three years (fall 1964 — spring 1967) 
the e3q>erimental treatments consisted of three different 
oral language programs: intensive instruction in English, 
intensive instruction in Spanish and an approach using the 
same content but no intensive language instruction. The 
content of all three was based on lessons selected and 
adapted from Science; A Process Approach which is a series 
of exercises "designed to improve the child's skills in 
using the processes of science."^ Thus, the goal was to 
teach not only oral language but also science skills such 
as observing, classifying and measuring. This involved 
using as much as possible of the "discovery approach" to 

^Commission on Science Education, American Associa- 
tion for the Advancement of Science, Science; A Process 
Approach, Washington, D.C., 1963: Arthur H. Livermore, 
"The Process Approach of. the AAAS Commission on Science 
Education," Readings in Science Education for the Ele- 
mentary School , ed. by Edward Victor and Marjorie S. 
Lerner, The MacMillan Company, New York, 1967, p. 413. 
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teaching science. Also the pupils were to handle concrete 
objects during the activities. According to Arnold: 

This content was selected over other possible 
content areas bee; use data informally analyzed 
had suggested that disadvantaged Spanish- 
speaking children did not find the content 
more difficult than did the more advantaged 
children. Only rarely will a child of any 
background have had extensive escposure to 
science concepts and the language of science 
before entering school. Also, the language of " ' 
science tends to remain at the descriptive and 
objective level and is relatively free of the 
affective domain where differences in value 
systems and social systems may affect learning. 
Therefore, differences related to ethnic and 
socio-economic groups were assumed to be at 
a rainimvun.-'- 

Another advantage of the AAAS materials and methods 
in this situation is that with or without the language 
teaching techniques the exercises are carried out without 
pupil books. The pupils need no reading skills to par- 
ticipate in the lessons. 

The lessons used by the English and Spanish treat- 
ments have been described as being structured oral language 
lessons. Frequently this has been interpreted by observers 
to mean that the language lessons were built on some out- 
line of the syntactic structures of each language. Actually, 
according to Ott, who was one of four authors of the 



c;.n ar^^^r,^ ? *^ °- "1965-66 (Year Two) Findings, 

D^L^Jnr " Research Project, Thomas L Horn, 

Director, The University of Texas, Austin, 1968, p. 7. 
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original ciirriculum for the first and second grades^ and 
who was the supervisor of the writers for the other grades, 
there was a concentrated effort at first to systematically 
include English sentence structures. ^ For the curriculum 
in third grade and beyond the writers were instructed to 
construct sentences necessary for the teachers and pupils 
to speak as they were to go through the selected exercises 
in the AAAS materials. The English language structures 
were not arranged in any particular order such as moving 
from the simple to the complex. The Spanish materials for 
all grade levels were developed by translating the English 
lessons. This procedure cannot be defended other than upon 
grovinds of expediency. 

The term "structured" referred to the fact that the 
teachers were given rather explicit directions for con- 
ducting the lessons. This was done deliberately, according 
to Ott, because when less "structured" lessons were tried 
out in Slammer school classes preceding their first use in 
the fall of 1964, there was great variation in teacher 
performance. 



-■■David P. Butts, Elizabeth H. Ott, Albar A* Pena, 
and Anne 0« Stemmler. 

*Staff Seminar, Southwest Educational Development 
Laboratory, Austin, Texas, September, 1970. 



The sentence patterns to be learned in both the 
intensive English and Spanish sections were presented or 
modeled by they teacher and repeated by the pupils in 
dialogue situations. After introduction of new patterns 
to the whole class, the teacher then initiated dialogues 
between teachei and small groups, between teacher and 
individual children, between groups of children and be- 
tween individual children. 

The dialogue in the lesson plans involved mostly 
questions and answers, affirmative and negative. HMiy of 
the answers involved elliptical responses, e.g., "Yes, they 
do." Some substitution drills were written into the lesson 
plans. The third and fourth grade teachers were ejcpected 
to develop drills involving such transformations as the 
passive, e.g., "The dog bit the boy. The boy was bitten 
by the dog," and coordinate conjunction reduction, e.g., 
"The girl ran. The girl played. The girl ran and played." 
To the extent that the teachers were not able to produce 
their own drills, the oral language teaching techniques con- 
sisted only of modeling and repetition. 

The emphasis in the lessons was on syntax and the 
use of conplete sentences. Problems of phonology were 
treated informally and incidentally because standard English 
pronunciation was not considered one of the prime goals of 
the Project and because many of the teachers were unable 
to model standard pronunciation. 
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The group receiving intensive English instruction 
was called the OAE (oral-aural English) treatment for the 
first three years (1964-67). After that the name was 
changed to ICE (language-cognition English). 

The Spanish group was called the OAS (oral-aural 
Spanish) treatment for three yecvrs and then became known 
as LCS (language-cognition Spanish). The Spanish treat- 
ment was originally included for two reasons: 

(1) to test the effect of instruction in the 
students' native language on the develop- 
ment of oral English and subsequent reading 
in English; and 

(2) to develop standard Mexican Spanish in place 
of limited local dialects. 1 

As subsequent information in this study will show, the 
effect of instruction in Spanish was not as beneficial as 
hoped. The second reason given above is regarded as lin- 
guistically naive. Regional Spanish dialects are not 
inconplete or unsystematic. They may be "different" from 
some other dialect, but they are not "limited." 

Tke group using the science content but no inten- 
sive language practice was called the NOA (non-oral-aural) 
treatment. This group was originally intended to be a 
control group but actually became thought of as a third 
escperimental treatment. 



m ^Anne 0. Stemmler, "An Experimental Approach to the 
Teaching of Oral Language and Reading," Harvard Educational 
Review, 36:45 (Winter, 1966). 
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The major function of the NOA group was to pro- 
vide data concerning possible "halo" effects, 
e.g. , consultant attention and merely trying 
something different, caused by the Group 1 and 
2 ^AE and OASj activities and to determine the 
effect, if any, of the science materials alone 
on language and cognitive development and reading 
readiness. 1 

The NOA treatment was dropped after the first three years 
and the children were assigned insofar as possible to LCE 
and LCS classes. 

Although the lessons were designed to be used only 
for oral language activities, it is known that at some 
point some teachers began to write the sentences from the 
special materials on charts or blackboards for the children 
to read. This change from the usual basal reading instruc- 
tion was not an official component of the escperimental 
treatments, but if it occurred more in one treatment than 
in another, it becomes a factor not accounted for by the 
research design. 

Throughout the years of the Project the special 
lessons were supposed to be taught one hour per day for 
about 140 days. For the initial first grade sanple (1964- 
65) language instruction received by the OAE and OAS groups 
replaced the reguler reading readiness instruction. The 
NOA pupils received the special science instruction in 
addition to the regular reading readiness program of the 



Itorn, "Year One Pind.-'.ngs , " p. 11. 
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schcx>l district. Thus, in the Year One study, when the 
three treatments were compared for their effectiveness as 
methods of developing reading readiness, the NOA group 
really received a combination of two methods of instruc- 
tion: regular reading readiness and the informal language 
activities associated with the sciencu activities. 

Teachers in the experimental treatments received 
supervision and consultant help from The University of 
Texas at Austin or from the Southwest Educational Develop- 
ment Laboratory staff members. In-service sessions were 
intended to develop skills needed for oral language and 
science instruction and to increase xanderstanding and 
acceptance of disadvantaged Mexican-Ttoierican children. 
Teachers in the NOA treatment did not, of course, receive 
training in oral-aural teaching techniorues. 

The children who entered first grade in Year Two 
(1965-66) received basically the same type of instruction 
in the experimental treatments through Year Five (1968-69) 
as did the children who entered first grade in Year One 
(1964-65). The first, second and fourth grade materials 
were revised slightly and the third grade materials were 
completely rewritten before the second group of children 
were exposed to them. 

Starting with Year Pour (1967-68) a different set 
of treatments was begun which applied only to children who 
entered first grade that year and thereafter. The focus 
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of the research by The University of Texas at Austin fac- 
ulty and students remained on children in the grades re- 
ceiving the original treatments , however. The content of 
the new programs was ejcpanded to include social studies as 
well as science. The lessons were taught in an ESL mode 
(English as a Second Language) or bilingually. There was 
neither a Spanish-only nor a non-oral- aural treatment. 
Some classes began the use of special reading materials 
which were designed to include only words and sentence 
patterns the children had learned previously in their lan- 
guage classes. 

The original or slightly modified oral language 
materials based on science continued in use in ESL or bi- 
lingual classes in San Antonio and in other pilot test sites 
of the Southwest Educational Development Laboratory through 
the 1970-71 school year. The English version was used in 
Guam xander Ott's supervision in the 1971-72 school year. 
Elsewhere a newly written set of materials was introduced 
by the Laboratory. The language exercises were very similar 
but the effort to develop science skills was discontinued 
and much of the science vocabulary was dropped. 

YEAR Om (1964-65) FINDINGS 
As Director of Cooperative Research Project No. 2648, 
Horn reported the results of the research done during Year 
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One (1964-65).! His study will be referred to as "Year One 
Findings." His primary purpose was "to coitpare the effec- 
tiveness of three methods of developing reading readiness."' 
The three methods were the OAE, OAS and NOA treatments. 
There were nine classrooms receiving oral-aural instruction 
in English (OAE), ten classrooms receiving oral-aural 
instruction in Spanish (OAS), and nine receiving non-oral- 
aural instruction with the science exercises (NOA). 

Horn also described a control group made up of 
children from twenty-one classrooms selected to represent 
all ethnic and socio-economic status groups in the San 
Antonio Independent School District. The control classes 
did not receive the sjiecial science curriculum in any form. 
Both the NOA treatment group and the control classes re- 
ceived the standard reading readiness program used in the 
San Antonio schools. Horn did not include data for the 
control group in his analysis. 



-^Thomas D. Horn, "A Study of the Effects of Inten- 
sive Oral- Aural English Language Instruction, Oral-Aural 
Spanish Language Instruction and Nbn-Oral-Aural Instruction 
on Reading Readiness in Grade One," The University of Texas 
at Austin, 1966. See also: Thomas D. Horn, "Three Methods 
of Developing Reading Readiness in Spanish-speaking Children 
in First Grade," The Reading Teacher . 20:38-42 (October, 
1966 ) . ^ 
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The OAE, OAS and NOA groups were reported by Horn 
to have received instruction from \inits based on a Texas 
Education Agency manual designed to serve as a guide to 
teachers in planning and teaching English to Spanish- 
speaking yoTing children.^ Apparently both the CAE and OAS 
teachers used lessons prepared by the Project staff. These 
lessons were in English and involved the use of audio- 
lingual techniques. Horn said that the NOA classes also 
used instructional units based on the TEA manual. How the 
NOA teachers used these materials and maintained the non- 
oral-aviral nature of their treatment is not clear. 

Horn did not report the amount of time spent by 
each treatment group on the adapted TEA materials. The 
more each group used the materials in English and with 
audio-lingual teaching techniques, the smaller the distinc- 
tions between the three experimental groups would have been. 
At this point, it can only be assumed that the science mate- 
rials were the basis of the major part of the instruction, 
and the three treatments were indeed dissimilar. 

Over 900 children were in the experimental classes. 
Complete initial data were obtained for 735 pupils. By the 
end of the year, transfera, drop-outs and absences decreased 



-'■Texas Education Agency, Pre-School Instructional 
Program for Non-Enalish Spealcing Children . Bulletin 642. 
Texas Education Agency, Austin, 1964. 
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the nianber for whom conplete data were collected to 584 
pupils, including 316 boys and 268 girls. They were dis- 
tributed as follows; 

OAE = 186 

OAS = 204 

NOA = 194 

Horn chose the class as the sanple xanit, rather 
than the pupil, in order to control in part the teacher 
variable. With the sainple sizes as small as reported 
above, it is clear that although the class was the statis- 
tical unit, data were available for many fewer than the 
total number of children in at least some of the classes. 
(The classes were reported to have had about twenty-seven 
to thirty pupils.^) 

Because the Project was one of twenty-seven first 
grade reading studies sponsored by the U.S. Office of 
Education, the staff was escpected to use several different 
instruments to assess pupil abilities. Many of the tests 
turned out to be completely inappropriate. Horn presented 
data demonstrating the inappropriateness of the following 
tests: Wbrd Meaning and Listening subtests of the Metro- 
TX>litan Readineaa TP>.q<-« (at the beginning of first grade), 
Phonemes and Letter Names sections of the Murphv-Durrell 



■^Conversation with Sue McClellan, a 
teacher, March 20, 1972. 
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Diagnostic Readin g Readiness Teat . Pattern Copying and 
Identical Forma. ^ in eyery case there were either large 
differences between local meana and national meana and/or 
a high number of zero acorea. No appropriate teat of 
oral language ability was available at that time, which 
left unmeaaured the major thrust of the OAE and OAS 
treatroenta . 

The analyaia Horn provided uaed tw) written instru- 
ment a: the Inter- Am erican Teat of General Ability adminia- 
tered in Spaniah aa a pretest in September and the Metro- 
politan Readiness Testa. Form A, administered in English 
as a posttest in the spring. 2 Althoxigh Horn did not say, 
the Metropolitan ReadineflA scores used must have been total 
scores. The Metropol itan Readiness Testa consist of these 
subtests: Word Meaning, Listening, Matching, Alphabet, 
Numbers and Copying. Similarly, the Inter-American Test of 
General Ability scores used in the analyses must have been 



iGertrude H. Hildreth, Nellie L. Griffiths, and 
Mary E. McGauvran, Metropolitan Readiness Tests . Form 
A, Harcourt, Brace and World, inc. , New York, 1964; Helen 
A. Murphy and Donald D. Durrell, Murphy-Durrell Diagno stic 
Reading Readiness Teat . revised edition, Harcourt, Brace 
and world. Inc., New York. 1964; Pattern Copying , research 
edition released by Thelma G. Thurstone; L.ll tffiurstone 
and T.E. Jeffrey, Identical Forma , research edition. The 
Chapel^Hili? The University of North Carolina, 

^ ^Herschel T. Manuel, Prueba de Habilidad General. 
Serie Interamericana. Primario. Nivel 1. Forma DEsT research 
edition, Guidance Testing Associates, Austin, Texas, 1962. 
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total scores. This test consists of these subtests: Oral 
Vocabulary, Nuitibers, Association and Classification. 

The first main question that Horn considered was: 
Will there be significant differences among the mean 
scores of the three groups? The means of the three groups 
on the posttest were: 

Metropolitan Readiness . Form 



Group Means N 

OAE 48.19 9 

OAS 49.86 10 

NOA 54.86 9 



These means can be conpared to the national means for chil- 
dren entering first grade: 53.21.^ 

T-tests conparing each pair of means revealed no 
significant differences. This is interesting in light of 
the fact that significant differences were found in the 
pretest measure. The means for that measure were: 

Inter-American Test of General Ability^ 



Group Means n 

OAE 13.82 9 

OAS 18.71 10 

NOA 20.30 9 



•*-Horn, "Year One Findings," p. 35. 
^ Ibid . . p. 47. 
^Ibid . . p. 36. 
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According to t-tests, the differences between OAE and OAS 
and between OAE and NOA were both significant at the .02 
level. 

It was later discovered that some of the principals 
had assigned younger, less mature pupils and pupils who had 
recently arrived in the United States to the OAE group in 
hopes that they would receive special help. This violated 
the random assignment procedure called for by the experi- 
mental design and probably accounts for the lower OAE mean 
on the pretest. 

Horn's third main question asked if there would be 
any differences among the mean posttest scores (Reading 
Readiness) for the three groups if the pretest scores 
(General Ability) were held constant. Because there was 
interaction among the posttest scores when controlling for 
pretest, the null-hypothesis associated with the third 
question was not tested. 

The second main question coinpared the boys versus 
the girls instead of the treatment groups. For both pre- 
tests and posttests the differences between the means of 
the boys and the girls were non- significant (pretest: boys 
mean — 17.77, girls mean — 18.20; posttest: boys — 49.91, 
girls — 51.53). Interaction between the posttest scores 
when controlling for pretest prevented the testing of the 
null hypothesis that there would be no difference between 
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the readiness scores of the boys and girls when pretest 
scores are held constant. 

Although not part of the original statistical' 
design, Horn made several additional comparisons. He 
corrputed the posttest means for the girls and the boys in 
each class. Then he tested for significant differences 
between the highest and lowest means for: (1) all the girls 
(2) the girls in each treatment group; (3) all the boys; 
and (4) the boys in each treatment group. Except for the 
girls in the NOA treatment group, all the differences were 
significant at the .01 level. This shows that there was 
a great range in initial ability and final readiness levels 
from class to class. 

Similarly, when posttest means were figured for 
each class (boys and girls together) and comparisons were 
made of high and low means of all the groups and for each 
group separately, the differences were significant at the 
.01 level.- Horn said that it was not clear whether this 
finding was due to teacher variables or to the existence 
of atypical classes. 

Two major conclusions can be drawn from Horn's 
study. First, the experimental treatments did not produce 
differential effects in reading readiness. Therefore, 
this generalization adequately describes the outcome: 
NOA = LCE = LCS. Second, many standardized tests were 
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clearly inappropriate for the population in the San Antonio 
LcUiguage Rese?rch Project. 

YEAR TWO (1965-66) FINDINGS 
Arnold provided the results of the research done 
during Year Two (1965-66).! report will be referred 

to as "Year Two Findings." The general problem he studied 
was whether or not differential growth occurred in reading 
achievement when children receive oral language training. 

Arnold's study involved two sairples: Sanple I 
which consisted of second graders who had been studied as 
first graders by Horn and Sample II which consisted of 
first graders enrolled during 1965-66. The study of Sample 
II was intended to be a replication of Sanple I. The spe- 
cial instruction received by Year Two's first graders was 
approximately the same as Year One's first graders. The 
major differences were that some of the language lessons 
had undergone some revision and many of the teachers were 
in their second year of experience with the new techniques. 
Most of the children in the second grade experimental 
classes had been in the same type of class the year before. 

Arnold included in his data analyses the scores of 
a control group. These children were in classes "selected 

%chard D. Arnold, "1965-66 (Year Two) Findings, 
San Antonio Language Research Project, Thomas D. Horn, 
Director," The University of Texas at Austin, 1968. 
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from various schools in the district and represented a 
cross-section of socio-economic levels and ethnic groups, 
thus deviating from usual control methodology."^ 

Data from four types of tests were analyzedT These 
types and the specific tests are listed below: 

1. Language Tests 

a. Brengelman-Manning Lincmistic CaDacity ^ 
Index (first grade only.)^ 

2. Group Intelligence Tests 

a. IPAT Culture Fair Intelligence Test 

(second grade only).^ 

Thurstone Pattern Copying Teat (first 

grade only).^ 
c. Goodenouqh-Harris Drav-A-Man Test (first 

grade only).^ 



^ Ibid , . p. 5. 

^pj^^^j.^^]^ jj^ Brengelman and John C. Manning 
T:!??^^^^^^ Capacity Index, Fresno State College, Fresno, 

m 4.. /^JL?*.*^^^^^®^-'-' institute for Person ality and Ability 

Testing flPAT) Cul ture Fair Intelligence Test. Scale 1 . 

Institute for Personality and Ability Testing. Champaign. 
Illinois. 1964. tr ^ • 

^ , ^Pattern Copying, research edition released by 
Thelma G. Thurstone. ' 

^F.L. Goodenough and Dale B. Harris, Goodenough- 
garris Dray-A-Man Test. Harcourt, Brace and World, inc., 
New York, 1963. 
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Test of General Ability. Inter-American 

Series (first grade only).^ 

e« Prueba d e Habilidad General (first grade 
only). 2 

3» Reading Tests. in English 

a» Test of Reading, Inter- American Series .^ 
Metropo litan Achievement Tests (second 
grade only).^ 

Metropo litan Readiness Testa (first grade 
only). 5 

4. Reading Tests in Spanish 

a* Prueba d e Lectura. Serie Interamericana .^ 



Herschel T. Manuel, Test of General Ability. Inte r- 
American Series, Primary. Level 1. Form DE. Guidance Testing 
Associates. Auatin. Texas. 1962. 

2 

Herschel T. Manuel. Prueba Qc Habilidad General . 
Serie Interamericana . Primario, Nivel 1. Itorma CEs. Guidance 
Testing Associates, Austin, Texas, 1962. 

Herschel T. Manuel, Test of Reading. Inter- American 
Series, Primary, Levels 1 and 2, Form DE, Guidance Testing 
Associates, Austin, Texas, 1962. 

T,-,^ Walter N. Durost (ed. ) , Harold N. Bixler. Gertrude 
H. Hildreth, and Kenneth w. Lund, Metropolitan Achievement 
2es£|, Primary. Levels 1 and 2. Form A. Harcourt, Brace and 
World, New York, 1953. 

^Gerturde H. Hildreth, Nellie L. Griffiths, and 
Mary E. McGauvran. Metropolitan Readiness Tests . Form A. 
Harcourt, Brace and World, New York, 1964. 

Herschel T. Manuel, Prueba de Lectura. Serie Int er- 
^ngricana, Primario. Niveles 1 and 2. Porma CEs, Guidance — 
Testing Associates, Austin, Texas, 1962. 
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The cox*trol group was given only the Test of Itoad- 
ing. Inter-American S eries , the IPAT Culture Fair Intelli- 
gence Test, the Metropolitan Achievement Testa , the Test of 
General Ability. I nter-American Series , and the Metrotxjlitan 
Readirxess Testa . 

Given the large number of tests, it should not 
be surprising that obtaining coirplete data for all the 
children in the experimental and control classes was 
impossible. 

The number of children for whom corrplete data were 
secured and the number of classes they represented is 
given below: ^ 

Grade Two Grade One 

Treatment Group Classes children Classes Children 



OAE 


12 


178 


12 


187 


NOA 


11 


177 


11 


160 


OAS 


10 


178 


7 


105 


Control 


12 


255 


12 


178 


(Total with complete data: 


788 




630) 


(Total enrollment: 




1125 




1050) 



The purpose of the analysis was to con^are the 
reading test scores of the three experimental groups and 
the control group. 



Arnold, "Year Two Findings," p. 8. 
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Simple conparisons of means, however, were 
deemed inappropriate because of substantial 
initial differences in both reading-related 
skills and intelligence.^ 

The control groups' pretest scores were in all cases much 
higher than the escperimental groups' scores. The NOA scores 
were generally second highest. 

Arnold used analysis of covariance in order to 
equate the groups in terms of pretest scores. Because 
there was interaction between treatment and covariate (the 
regression; lines were not parallel) in some cases, analysis 
of covariance was not always possible. In the situations 
where analysis of covariance was not possible and where the 
r'ilationship between the dependent and independent variables 
was strong enough to warrant it, separate coirparisons were 
made for high and low scores. Low scores were defined as 
those below the fifteenth percentile and high scores those 
above the eighty-fifth percentile. 

The basic null hypothesis was that "the post-test 
scores most typically associated with a given pre-test 
score are equal for all treatments."^ He outlined forty- 
one different null hypotheses. When differences between 
treatments were constant throvighout the range of the 



^Ibid . , p. 13. 
^Ibid. , p. 22. 
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covariable, Arnold tested for differences between these 
groups at the second grade level: 

(1) OAE vs. OAS. 

(2) OAE and OAS pooled vs. NOA. 

(3) NOA vs. Control. 

(4) OAE, OAS and NOA pooled vs. Control. 

He made conqparisons of these groups at the first grade 
level : 



(1) 


NOA 


vs. OAS. 


(2) 


NOA 


vs. OAE. 


(3) 


OAE 


vs. OAS. 


(4) 


OAS 


vs. Control. 


(5) 


OAE 


vs. Control. 


(6) 


NOA 


vs. Control. 


(7) 


OAE 


and OAS pooled vs. NOA. 


(8) 


OAE 


and OAS pooled vs. Control 



In the main body of the report Arnold first pre- 
sented the means and standard deviations for each group at 
each grade level on each test and sxibtest. He also pro- 
vided tables which summarized these analyses: 

(1) Analysis of covariance of Test of Reading . 
Inter-American Seri^a. Metrotxplitan Achievement Testa , and 
Prueba de Lectur a, Serie Interamericana . Grade Two. 

(2) Doxible covariance analyses of Test of Reading . 
Inter-American Serif>a and Metropolitan Achievement Tests . 
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Grade Two. (The precest scores of these tests and the IPAT 
Intelligence Test scores were the co variables. ) 

(3) Analysis of covariance of Prueba de Lectura . 
Serie I nteramericana . Brenaelxtian'-Mannina Linguistic 
Capacity Index. Test of Reading. Inter-American Series , 
and Metropolitan Readiness Tests . Grade One. 

(4) Analysis of covariance of high and low pre- 
test scores of Brenaelman-Manning Linauis-'cic Capacity 
Index, Test of Reading.' Inter-American Series , and Metro- 
politan Readiness Tests . Grade One. 

A lengthy appendix includes for each criterion and 
each appropriate covariate: 

(1) the multiple correlation coefficient (squared) 
for. the full model. 

(2) the squared coefficient for the model obtained 
under the null hypothesis. 

(3) the degrees of freedom associated with the F- 
statistic obtained by coinparing the two models. 

(4) the F-ratio. 

(5) the probability of chance occurrence of full 
model values under the conditions inposed by 
the null hypo the sis. ■'■ 

Of special interest are the findings related to the 

Spanish reading test, Prueba de Lectura, Serio Interameri- 

cana. In making comparisons at both grade levels, when 

there vas no interaction between treatment and covariate, 

Arnold found that the differences between groups in all 

cases were not significant. In other words, special oral 



Ibid. , p. 79. 
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language instruction in Spanish had no differential effect 
on achievement in reading in Spanish — as measured by the 
Prueba de Lectura. Some teachers in the special language 
classes gave their children instruction in reading from 
escperience charts using the sentences from the language 
lessons, if it had been Icnown whether or not the children 
in OAS classes received this kind of instruction — reading 
in Spanish from experience charts — the data results would 
have been even miore significant. 

Because there were so many hypotheses investigated 
and so many tests used, the reporting of the results was 
long and complicated. The condensation here is taken 
from Arnold's summary. 1 

The results of the analysis of the data from Grade 
Two were fairly consistent. The differences between the 
OAE and OAS groups were significant only when the Word 
Knowledge subtest of the Metropolitan Achievement Tests was 
the criterion. This one difference favored the OAE group. 
All differences between the NOA group and the OAE and OAS 
groups together were significant, favoring the NOA group. 
When coinparisons were made between the control group and 
the OAE, OAS and NOA groups together, the scores of the 
control group generally were significantly greater, it 



Ibid . , pp. 64-66, 
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mist be remerabered that the contposition of the control 
group was different from the experimental treatment groups, 
i.e., the children represented a cross-section of ethnic 
groups and socio-economic levels, in summary, this gener- 
alization is accurate for the second grade: control 5^ 
NOA ;> CAE = OAS. 

The results of the analysis of the data from Grade 
One vrere not consistent. Moreoever, there was a marked 
tendency toward interaction between treatment and covariate; 
thus analysis of covariance was frequently inappropriate. 
Therefore, Arnold separately analyzed low and high pretest 
scores . 

When contparisons of low pretest scores were mad3 
and when the criterion was the Test of Reading, inter- 
" American Serieg, the differences were non-significant or 
favored the OAE treatment. When the criterion was the 
Metropolitan Readi ness Tests , the control group scores 
were generally significantly higher. 

When comparisons of high pretest scores were made 
and when the criterion was the Test of Reading. inter- 
American Series, the CAE group v/as significantly superior 
to the NOA and OAS groups. Differences between the OAE 
and control groups were not significant. When the cri- 
terion was the Metropolitan Readiness Tests , there were 
no significant differences between each pair of the 
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experimental groups OAE, OAS and NOA. The differences 
between the control group and each experimental group were 
either not significant or favored the control group. 

In summary, the findings for the first grade showed 
that in general when the criterion was a reading test, the 
OAE treatment was superior but when it was a readiness test, 
the control group was superior. 

YEAR THREE (1966-67) FINDINGS 
Knight presented the results of the research done 
during Year Three (1966-67).! Knight's report will be 
referred to as "Year Three Findings." As with Arnold's 
study of the Year Two (1965-66) findings, the major prob- 
lem was to evaluate the effect of oral language training 
on reading achievement. In studying this problem, he 
asked three c[uestions: 

(1) Is one treatment generally more effective 
than the other treatments? 

(2) Do children who score high, middle or low 
in the fall tend to perform similarly in the spring also? 

(3) Does the difference between treatments vary 
according to pretest scores? 



Lester N. Knight, "A Comparison of the Effective- 
^tt^ ? intensive Oral-Aural English Instruction, Intensive 
orai-Aural Spanish Instruction, and Non-Oral-Aural Instruc- 
TnS!^ Reading Achievement of Spanish-Spealcing Second- 
and Third-Grade Pupils," Unpublished Doctoral Dissertation, 
The University of Texas at Austin, 1969, ^-^ion, 
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The groups were the same OAE, OAS and NOA groups 
studied by Horn and Arnold. Knight used no control group. 

According to the description procedtires in his sum- 
raary chapter, Knight randomly selected thirty pupils in one 
treatment group for the data analysis.^ Then he chose thirty 
pupils from each of the other two groups who had pretest 
scores identical with those in the initial group. 

Knight used one group intelligence test, two English 
reading tests and one Spanish reading test. The group in- 
telligence test was given in the fall to both grades: IPAT 
Culture Fairf;TntelI iaence Test (3rd grade. Scale 2j 2nd grade. 
Scale 1). The reading tests were given to both grades in 
the fall and spring: Metropolitan Achievement Testa (Primary 

1, Form A; Primary II, Forms A and C); Test of Reading, inter- 
Am^riqan Series (Primary. Level 1. Form DE, and Primary. Level 

2, Forms CE and DE); and Prueba de Lectura. Serie Interameri- 
c^na (Primario, Nivel 1, Forma CEs and Primario. Nivel 2, 
Formas CEs and DEs). 

Knight classified the fall scores of his sai^les 
at both grade levels in two dimensions: treatment and 
level of pretests (high, medium and low). He used spring 
scores as dependent variables and 'iconducted two-factor 
analyses of variance to obtain F ratios and tests of 

llbid. , p. 201. 

2see the previous section for publication infor- 
mation for all the tests listed here. 
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significance for treatment main effect, pr(?test main effect, 
and interaction, 

Knight tested six null hypotheses for each grade 
level. The pretest and posttest scores involved in testing 
each hypothesis are listed below: 

Hypothesis Pretest Scores Posttest Scores 

1 IPAT Culture Fair Metropolitan 
Intelligence Test Achievement Tests 

2 IPAT Culture Fair Test of Reading 
Intelligence Test 

3 IPAT Culture Fair Prueba de Lectura 
Intelligence Test 

4 Metropolitan Metropolitan 
Achievement Tests Achievement Tests 

5 Test of Reading Test of Reading 

6 Prueba de Lectura Prueba de Lectura 
Each of the spring tests had three subtest scores 

and a total score, so for each hypothesis there were four 
differences to be tested. Thus there were twenty-four 
hypotheses all together. 

In answer to his first question (treatment main 
effect). Knight found that fifteen of the twenty-four pos- 
sible differences at the third grade level were significant. 
Nine favored the NOA group. Six favored the OAE group. For 
second grade, only seven of the treatment findings were sig- 
nificant. Six favored the OAE group and one the NOA group. ^ 



^Knight, "Year Three Findings," p. 128. 
2 

Ibid. < pp. 205, 208 and 209. 
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In answer to the second question (pretest main 
effect), he found that the pretest effect was highly sig- 
nificant in almost all of the analyses. In other words, ' 
children who scored high, middle or low in the fall tended 
to perform similarly in the spring.^ 

In answer to the third question (interaction), he 
found only a few significant interaction effects and they 
followed no consistent pattern. This means that if a 
certain treatment resulted in the highest scores at the 
high pretest level, it tended to do the same for the other 
levels of the pretest. ^ 

In summary, none of the significant differences 
regarding treatment favored the OAS group, even on the 
Spanish reading test, Prueba de Lectura . The significant 
treatment findings for third grade tended to favor the 
NOA group? the significant treatment findings for second 
grade tended to favor the OAE group. This latter might 
be explained by the fact that most of the second grade 
teachers had worked with the new materials and methods the 
year before. 

The results of research performed during Year 
Four (1967-68) are reported in Chapter m of this study. 



■^Ibid. , p. 202. 
^Ibid . . p. 214. 
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YEAR FIVE (1968-69) FINDINgS 
Taylor reported the results of the research done 
during Year Five ( 1968-69 ).'^ Her dissertation will be 
referred to as "Year Five Findings." she developed an 
instrument to measure "quality and quantity of productive 
language" in order to determine if the pupils exposed to 
the ICE and ICS treatments for four or five years had 
benefitted significantly in language development. ^ she 
coirpared the language of the experimental groups of pupils 
with that of the "control" children, who had received no 
special language treatment, with regard to phonology, 
intonation and fluency. 

Taylor made an effort to identify all pupils still 
in Project classrooms who had received the same treatment 
during their entire school career. Table 1 shows the 
number of pupils identified as being in continuous treat- 
ment compared to the number of children for whom coinplete 
data were collected in the 'first year. The original 
Project involved 28 classrooms in nine schools. By Year 
Five (1968-69) the students in continuous treatment were 



Thomasine H. Taylor, "The Effects of Oral-Aural 
Language Training on Gains in English Language for Fourth 
^ t^-^ Grade Disadvantaged Mexican-American Children," 
Unpublished Doctoral Dissertation, The University of Texas, 
Austin, 19o9. 

^ Ibid . . p. 8. 
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Table 1 

"""^^^rl^iif ??96rtQ?^?^ ^^'^^"^^^ Treatment Groups 
rr^^^L I J Compared to Number for Whom 

Conplete Data Were Collected in First Grade, San 
Antonio Language Research Project-I- 



Treatment 


Grade 1 Grade 5 
(1964-65) (1968-69) 


Grade 1 Grade 4 
(1965-66) (1968-69) 




Sample I 


Sairple II 


LCE^ 


186 34 


187 125 


LCS^ 


204 33 


105 29 



^ J-Compiled from data presented by Taylor, op. cit. 

p. 63; Horn, o^. cit., p. 18; and Arno^, o^. citT^ prTe! 

^LCE = Language-Cognition English 

^LCS = Language-Cognition Spanish 
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in only six schools. Taylor did not say how many differ- 
ent teachers, i.e., classrooms, were involved. 

The nxamber of pupils who received continuous treat- 
ment dropped considerably except for the fourth grade LCE 
group. Taylor did sainple randomly, however, from that 
group so that for her study the size was reduced to twenty- 
seven. The children in the control groups were selected 
from a school not in the Project but which had a population 
similar to that of the schools in the Project. Due to 
absences, transfers and inaudible recordings of some of 
the oral responses, the number of children in each treat- 
ment group and the number or controls were reduced further. 
The final number of children for whom data were collected 
and analyzed is shown below: 

Sanqole I Sample II 

(Fifth Grade) (Fourth Grade) 

LCE 32. 27 

ICS 26 20 

Control 29 27 

In order to test oral language proficiency, Taylor 
chose to take measures of phonology, intonation and flu- 
ency. She did this by using two filmstrips and recorded 
modeled sentences from two sets of Gloria and David Series 
materials used for teaching English or Spanish as a second 
language. Test 6 contained twenty pictures which the 



children watched as forty sentences were modeled on a 

tape.^ The children were asked to say the sentences 

"exactly the way" the lady on the tape said them. The 

children's sentences were recorded and evaluated later 

for both phonology and intonation. For testing fluency, 

five frames of another filmstrip were shown to the chil- 
2 

dren. Each frame was shown for thirty seconds and the 
children were asked to pretend that they were radio an- 
nouncers telling people about the pictures. They were 
requested to keep speaking until a new picture appeared. 

To measure phonology Taylor evaluated the last 
thirty-six of the forty sentences repeated on the tape 
accoir5)anying the first filmstrip. (The first four re- 
sponses were regarded as practice sentences.) She covmted 
as errors only gross deviations which would be apparent 
to a lay listener like a classroom teacher. To get a 
total score, the number of deviations was subtracted from 
616, the number of phonemes in the thirty-six sentences. 

On intonation, four points were possible for each 
sentence. Therefore, a total of 144 points was possible. 
Taylor gave two accounts for how the maximum number of 



^Gloria and David Beginning English, Series No. 20, 
Test 6, Language Arts, Inc., Austin, Texas, : ^58. 

^« ., ^Gloria and David Spanish Intermediate, Series No. 
60, Saturday's Activities," Language Arts, Inc., Austin, 



53 

points could be earned. According to one, four points were 
giv^n for a close approximation to the modeled sentence. 
In tfee other e3cplanation, she said that if it was obvious 
that the child knew what he was saying, four points were 
awarded, in a personal comm\ini cation Taylor said that 
these two criteria did not conflict. 

On intonation, a child was given three points if 
he repeated the sentence with mee.ningful intonation but 
missed special stress on a particular word. If the child 
garbled the sentence or appeared to be trying to remember 
only isolated words, he received two points, if there 
was no response, one point was given. From the descrip- 
tion of the criteria Taylor used to judge the responses, 
it is not clear what her definition of intonation was. 

To score fluency, Taylor counted the words said in 
response to the pictures on the second filmstrip. she pro- 
vided no definition of fluency and offered no rationale 
which explained why a word covmt was chosen as the measure 
of fluency. 

Taylor also computed a total language score by 
adding the other three scores together. Adding together 
raw scores to get a total sjcore may be a questionale pro- 
cedure. 

The following hypotheses were considered in the 
analysis of the data: 
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There will be no significant difference among the 
ICE. LCS and Control groups on Phonology, Intonation, 
Fluency and Total Language at the fifth grade and at the 
fourth grade levels (Hypotheses 1-8). 

There will be no correlation between the three sub- 
scales and total score: (a) Phonology and Intonation j 
(b) Phonology and Fluency, (c) Intonation and Fluency? and 
(d) each svibscale with Total Language (Hypothesis 9). 

Ar,aly&is of variance was performed to test the 
first eight hypotheses. Table 2, an adaptation of one of 
Taylor's tables, shows the order of the means for each 
group for Phonology, Intonation, Fluency and Total Lan- 
guage at each grade level. Included also is an indication 
of whether or not the differences in the means reached 
significance. 

For Saitple I (Grade Five) analysis of variance 
showed that the Phonology scores were significantly dif- 
ferent at the .01 level and the Fluency and Total Language 
scores were significantly different at the .05 level. The 
LCE treatment group received the highest mean score in 
Phonology and the LCS treatment group received the highest 
mean scores in Fluency and Total Language. In all three 
cases, the control group received the lowest mean scores. 



Table 2 



Sioimrtary of Order of Means for Phonology, 
Intonation, Fluency, and Total Lancmaae. 
Year Five (1968-69), San Antonio 
Language Research Project^ 



Saiaple I (Grade 5) 



Sasple II (Grade 4) 



Phonology 


ICZ 2 7 LCS 3 > Control * * 


LCS 


^ LCE Control 


Intonation 


LCS Control 7" LCE 


LCE 


✓'•Control-^ LCS 


Fluency 


LCS 7 LCE 5^ Control* 


LCS 


5K:ontrol-7 LCE 


Total Language 


LCS-7 LCE 7 Control* 


LCS 


5r Control :^ LCE 



**p<r.oi 



Con5)iled from Taylor's Table 16, "Year Five 
Findings," p. 110. 

2 

I^E = Language-Cognition English 
^LCS « Language-Cognition Spanish 
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For Sample II (Grade Four) none of the findings 
were significant. The ICS treatment group scores were 
highest on all tests except Intonation. 

Taylor coinp(uted an intercorrelation matrix to test 
the ninth hypothesis using the three subtest scores and 
total scores for children at both grade levels. Her Table 
9 is reproduced as Table 3. It should be noted that al- 
though all the correlations were high enough to be sig- 
nificantly different from zero, they were not necessarily 
high enough to account for a very inportant proportion of 
the variance in one score which can be attributed to 
variation in the other. 

The correlation between Total Language and Fluency 
is so high (.96) that, although the correlation is spur- 
iously high because the Total Language score includes the 
Fluency score, Taylor is justified in concluding that the 
Fluency test gives essentially the same information as the 
whole series of tests. ^ She may not be justified, however, 
in suggesting also that the Fluency test be used as a 
"quick measure of general language proficiency" because 
she has not provided an argxjment showing that her test 
does in fact measure "general language proficiency."^ 

^uinn McNemar, Psychological Statiaf -ina . 3rd. ed. , 
John Wiley and Sons, Inc., New York, 1962, p. 164. 

^Taylor, "Year Five Findings," p. 119. 
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Table 3 



elation Coefficients Coinputed on All Studen 
the Saaple (Kb=161) for Phonology. Intonatio 
Fluency, and Total Language, Year Five 
(1968-69), San Antonio Langioage 
Research Project^ 





Phonology 


Intonation 


Fluency 


Intonation 


.2213** 






Fluency 


.4198** 


.1600* 




Total Language 


.6030** 


.2870** 


.9640** 



* p^.05 
** p^.Ol 



^Taken from Taylor's Table 9, "Year Five Findings," 
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Although Taylor referred to her test as ona of 
general language proficiency, the test does not measure 
control of syntax nor the extent of each child's vocabu- 
lary. In addition, the group means for both grades on 
Intonation, ranging between 141.05 and 142.31, were so 
close together and so close to the total possible score 
of 144 that one may conclude that the measure of intona- 
tion was not very sensitive. An alternative explanation 
would be that intonation was quite good for all groups. 
Taylor did speculate in a discussion of the limitations of 
her study that the scoring system was inappropriate and 
that a better one would involve only a right or wrong 
rating. 

Observation of the data prompted Taylor to add one 
conparison to her analysis. She conpared the Grade Five 
scores on all subtests and the total with the Grade Four 
scores for each treatment group and the control group. 
Disregarding Intonation scores which have the difficulty 
previously mentioned, the Grade Five scores were higher 
than the Grade Four scores in all cases except on the 
Fluency subtest and the Ttotal Language score for the con- 
trol groups. Taylor used a t-test to see if the Fluency 
scores were significantly different and found that they 
were not. She did not determine if the Tctal Language 
scores were also significantly different. 



r 
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In discussing these findings, Taylor commented in a 
misleading way. Since no data were supplied, nor available, 
concerning the starting points of the children in the fifth 
or fourth grades as they began their school careers or at 
the beginning of the Year Five school year (1968-69), it 
might have been niore accurate for Taylor to discuss the dif- 
ferences between the fourth and fifth grade control scores 
rather than referring to a regression . 

RELATED STUDIES 

Studies of Oral Language 

Ott 

Ott's dissertation is one of several reporting the 
development and use of instruments for assessing oral lan- 
guage, and will be referred to as "A Study of Oral English." 
She stated that the intent of her study was "to determine 
levels of fluency and proficiency in the use of oral 
English, expressive operant, of Spanish-speaking school 
beginners, comparing two methods o.: instruction."^ The 
.wo methods were the OAE and NOA treatments for Grade One 
students (Sample II) in Year Two (1965-66). She went on 



Elizabeth H. Ott, "A Study of Levels of Fluency 
and Proficiency in Oral English of Spanish- Speaking School 
Beginners," Unpublished Doctoral Dissertation, The Uni- 
versity of Texas, Austin, 1967. 

2 

Ibid . . p. 8. 



/ 
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to say, however, that besides comparing the ability to speak 
(expressive operant) of the CAE and NOA treatment groups, 
she was also going to coir^jare the ability of the groups "to 
communicate in oral English in the receptive operant"^ which 
she defined as the ability to "understand verbal instruc- 
tions and to conprehend language used in basic everyday 
communication in the school environment when spoken at a 
conversational rate."^ 

Ott used the Brengelman-Manning Linguistic Capacity 
Index to assess receptive language abilities (iinderstand- 
ing). To measure productive lar^iage abilities (speaking) 
she collaborated with Jameson on the development of a new 
instrument.'* This test was referred to by several differ- 
ent names in both the ott and Jameson dissertations. Here 
it will be called the Ott- Jameson Test . 

Til® Linguistic Capacity Index is a group test de- 
signed for use with primary grade pupils whose native 
language is Spanish. The examiner asks the children to 
mark or circle the appropriate picture in a set of line 



^luxd . . p. 9. 

^ Ibid . . p. 12. 
3 

Frederick H. Brengelman and John C. Manning, Un- 
qui Stic Capacity Index, Fresno State College, Fresno, 1964 < 
Manual (ditto). Pupil Booklet (offset). 

4 

Gloria R. Jameson, "The Development of a Phonemic 
Analysis for an Oral English Proficiency Test for Spanish- 
Speakang School Beginners," Unpublished Doctoral Disserta- 
tion, The University of Texas, Austin, 1967. 
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drawings. By marking the right picture the children indi- 
cate their understanding of certain vocabulary items, 
soxind contrasts and grammatical structures. The three 
sections of the test, Vocabulary Recognition, Contrastive 
Phonology, and Contrastive Grammar, each contain twenty 
items. 

Ott began development of the Ott-Jameson Test in 
July, 1965. It was administered in a pretrial run to 
twenty subjects during the first two weeks of September. 
It was then administered to seventy-five first grade sub- 
jects. 

At the time of the September administration, the 
test consisted of three parts, in the first part the 
examiner showed drawings to each child and spoke six to 
nine sentences for each of twenty-five phonemic problems. 
The child was told to repeat each sentence. His response 
was tape recorded in order to be evaluated later. In the 
second part the child was asked to select three items from 
a group of five: doll dress, whistle, toy automobile, 
plastic wristwatch and a small cream pitcher. The exam- 
iner then asked five questions about each item. In the 
third part, the examiner showed the sxibject three photo- 
graphs (children eating and drinking, children playing on 
a playground, and children playing in groups) and asked him 
questions about each photograph « 
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After the test was administered in September, Ott 
decided to eliminate the second part since it elicited a 
limited number of measurable responses. Jameson was asked 
to revise the first part and to devise a scoring system 
for it. Jameson's work on the test is further described 
in the next section of this paper. According to Ott, the 
first part was changed only by eliminating one phonemic 
problem and adding one other. ^ This seems to be contra- 
dicted by Jameson, who reported making three separate 

revisions between the September and May administrations of 
2 

the test. Since ott did not provide a copy of the Septem^ 
ber version, no direct examination of the two versions was 
possible. Ott did not include the three iter.s related to 
the phonemic problem added to the May version in the pre- 
post coraparisons. The sentences and questions for the two 
remaining parts to be used in May were recorded on" a master 
tape; the photographs for the second part were replaced by 
color filmstrips. 

Ott included results of fifty-four sentences in the 
first part, the phonemic analysis. Each sentence contained 
one word which was judged as: (1) pronounced correctly: 
(2) pronounced with a minimum phonemic change, i.e. , with 



■Ott, "A Study of Oral English," p. 65. 
Jameson, o£. ext., pp. 19-23. 
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the predicted error; or (3) not pronounced at all, pro- 
noianced with an error not predicted or pronounced with a 
response not identifiable. Pronunciation of the other 
words in each sentence was not relevant for the scoring 
scheme. 

The second part of the Ott-Jameson Test included 
twenty-nine questions about several pictures. The ques- 
tions were classified as being on the literal level, the 
non- literal/inferential level or the imaginative level. 
This classification was not taken into account in the 
scoring syst^, however. The scoring was done by counting: 
(1) the total nxanber of responses; (2) the niairiber of re- 
sponses using Spanish intonation; (3) total word covuit 
with Spanish intonation; (4) the number of responses using 
English intonation; and (5) total word count with English 
intonation. 

Apparently, if a child answered a question by one 
word, a phrase, a sentence, or several sentences, he was 
given credit for one "response." Since intonation refers 
to stress and pitch contours, it is not clear how the term 
intonation was used with reference to individual words in 
the word count measures. 

In various places, Ott said that the second part 
of the test was intended to measure: 

(1) appropriate word choice (or vocabulary); 



(2) control of syntax (or structural features)? 

(3) intonation; 

(4) juncture: 

(5) stress; 

(6) extent of proiipt and fluent response as 
measured vdth a word count; and 

(7) fluency as evidenced by extent of prompt 
and continuous eacpression in English.^ 

It is not obvious how the method of scoring actually mea- 
sured all these aspects of language. In no place did Ott 
eacplain the relationship between her scoring system and 
what the test was intended to measure. One wishes that 
Ott had more clearly spelled out the procedures used in 
scoring the tapes for the responses to the Ott-Jameson 
Test. For instance, it is iniport.unt to know if the scorer 
was aware of the treatment group from which each subject 
came before ruling out the possibility of systematic bias 
in the data analyses. 

In addition to administering the two language tests, 
Ott also collected the IQ scores from the fall administra- 
tion of the Goodenouoh-Harris Draw-A-Man Teat .^ 



^Ott, "A Study of Oral English," pp. 10, 11, 22, 
118, 119, and 141. 

2p.L. Goodenough and Dale B. Harris, Goodenovtoh- 
Harris Draw-A-Man Test. Harcourt, Brace and Vibrld, Inc., 
New York, 1963. 
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Ott chose a small sample from four of the eight 
Project schools "on the basis of convenience. Seventy- 
five children were tested in September and fifty-eight in 
i4ay. Every child met the following criteria: 

(1) Score no lower than 70 on the Goodenough- 
Harris Draw-A-Man Test . 

(2) Age no greater than 7.0 as of September 1, 
• 1965, and no previous exposure to a first 

grade instructional program. 

(3) Little or no functional use of oral English 
as indicated by level of fluency and pro- 
ficiency as determined by the Ott-Jameson 
Test . 

(4) Present in class and physically well enough 
to participate in pre- and post-testing. ^ 

The rationale for the third criterion listed above 
is not apparent. iMoreover, one does not know how a sub- 
ject will perform until after the test has been adminis- 
tered, so the third criterion must have been applied ex 
£ost facto. Application of the third criterion appears 
to mean that children with initial strength in English 
were excluded from the sample. Thus, any conclusions 



^Ott, "A Study of Oral English," p. ii5. 
2ibid. , p. 19. 



reached about the effectiveness of the OAE and NOA treat- 
ments apply only to children who initially had "little or 
no fxinctional use of oral English. " 

The fifty-eight children in the final sairple were 
evenly distributed between the NOA and OAE treatment 
groups. In light of Ott's own conclusions and of later 
findings, it was unfortunate that she did not include also 
children from the OAS treatment group and from classes 
receiving instruction in the regular San Antonio curric- 
ulum. 

In her data analysis, ott determined correlations 
for the variables listed below. These were also the vari- 
ables to which she applied analysis of variance. 
Variable Name 

1 Correct Response ( Ott- Jameson . Part I) 

2 Minimum Change ( Ott- Jameson ; Part I) 

3 Missed ( Ott- Jameson . Fart I) 

4 Total Responses ( Ott- Jameson . Part II) 

5 Responses, Spanish Intonation ( Ott- Jameson . 
Part II) 

6 Wbrd Count, Spanish Intonation ( Ott-Jamesor . 
Part II) 

7 Responses, English Intonation ( Ott- Jameson . 
Part II) 

6 Word Count, English Intonation ( Ott- Jameson . 

Part II) 
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9 Vocabulary Recognition ( Lincruistic Capacity 

Index ) 

10 Contrastive Phonology ( Lincmistic Capacity 

Index ) 

11 Contrastive Grammar ( Lincruistic Capacity 

Index) 

12 To-cal Score ( Linguistic Capacity Index ) 

13 Goodenough-Harris IQ Score ( Draw-A-Man Test ) 
Ott first provided a table showing the Pearson 

product Moment correlations coir5)uted for the September and 
May scores of the language tests. These test-retest cor- 
relations were presented as evidence of reliability of the 
tests but the conclusions must be questioned because of 
the time lapse between September" and May, during which the 
ability being measured may have changed. In fact, the goal 
of tho experimental program was to enhance the very abili- 
ties being tested. Therefore the reliability coefficients 
were not the correlations "between two conparable measures 
of the same thing. "1 Moreover, reliability of a test 
should be established on groups different from the one for 
which the test is being used as a measure of growth. 

Ott also included three tables which gave Pearson 
Product Intercorrelation Matrices for the total san5>le, the 



■McNemar, 0£. cit . , p. 146. 



OAE group and the NOA group. The most interesting finding 
was that for the total sairple all correlations between the 
Ott- Jameson Teat variables and IQ at both September and 
May administrations failed to be significant at the .05 
level, i.e., r>.23. The highest correlation with IQ of 
a Linguistic Capac ity index variable was only .28 for 
Contrastive Phonology in Septeinber. 

A third set tif tables showed mecuis and standard 
deviations for September and May and the mean gains for 
each variable for the OAE group and the NOA group. The 
tables contained data for an extra variable: Total Word 
Count. Ott gave no explanation for this addition nor for 
how the Total Word count could be less than the sum of the 
Word Count, English Intonation and the Word Count, Spanish 
Intonation. A second item not explained was a standard 
deviation of 37.99 for Word Count, English Intonation in 
September for the OAE group while the mean was only 21.48. 

For the OAE group the trends easily seen on the 
table of means and standard deviations included a large 
increase in Correct Responses on the phonology part of the 
QtWfUHfiSQn Tfif=}t , and in Responses, English Intonation 
and Word Count, English Intonation on the fluency part of 

Ott-Jameaon Test. Thus, there were also large de- 
creases in the number of Minimum change responses. Re- 
sponses, Spanish Intonation, and Wbrd Count, Spanish 
Intonation. 
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For the NOA group there ware large increases in the 
number of Correct Responses on the phonology section of the 
Ott- Jameson Test and in Word Count, English Intonation. 
There were no large decreases. Even the Word Count, Spanish 
Intonation was greater in May. 

When one compares the word counts to th.^ number of 
responses of a particular intc. ation for both groups, it 
becomes obvious that the children must have spoken in sen- 
tences of from an average of two and a half to five words. 
The shortness of the sentences was neither commented on nor 
explained by Ott. 

Ott then used repeated measures analysis of variance 
on each variable in the language tests except for the Total 
Score on the Linguistic Capacity Index . She provided tables 
and figures for each variable giving F's and p' s for: 

(1) groups; 

(2) testing periods; and 

(3) groups across testing periods. 
In other words, she conipared: 

(1) the combined pretest and posttest scores of the 
two groups (OAE and NOA); 

(2) the combined OAE and NOA scores at the two 
administration times (May and September); and 

(3) the gains in the scores of the two groups 
between Septeinber and May. 
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Table 4 s-ammarizes these conparisons. 
All the scores between testing periods for the 
groups together (conparison #2 listed above) differed sig- 
nificantly at the .001 level except for Variable 6, Wbrd 
Count, Spanish Intonation, which differed significantly at 
the .03 level. All changes from September to May for both 
were in directions indicating growth in English oral lan- 
guage as measured by the Ott-Jameson Tf>at and the Unouistic 
Capacity Index except for one. The NOA group increased 
slightly on Word Count. Spanish Intonation. 

The scores of the two groups (DAE and NOA), col- 
lapsing over time (conparison #1 listed above), differed 
significantly only on Variable 2, Minimum Change (p = .04), 
Variable 5, Responses, Spanish Intonation (p = .02) and 
Variable 8, Word Count, English Intonation (p = .01). 

On Variable 2, Miniraxim Change, the OAE group was 
nine responses higher than the NOA group in September. 
Both groups showed a decrease in Minimum Change responses 
between September and May, but the OAE group had a much 
greater decrease (a decrease indicates inprovement in 
standard English pronunciation). The difference in gain 
scores for the two groups was significant at_the .OOi level 
(conparison #3 listed above). 

The OAE and NOA groups both scored thirteen on 
Variable 5, Responses, Spanish Intonation, in September 
but in May the OAE group dropped to five while the KOA 
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Table 4 

F-Ratios and Probability Levels for Three Conparisons 
of Ott-Jameson Test and Lincmistic Capacity Index 
Variables for First Graders in OAE (Oral-Aiiral 
English) and NOA (Non-Oral-Aiiral) Treatment 
Groups, Year Two (1965-66), San Antonio^ 

Testing Groups Across 

Groups Periods Testing Periods 

Variable P g p p F p ~ 

Ott-Jameson Test 

1 Correct ^1.00 n.s. 369.32 <.001 68.16 ^.001 
Response 

2 Minimum 4.38 .04 290.71 .001 76.58 .001 
Change 

3 Missed < 1.0 n.s. 14.28 .001 < 1.0 n.s. 

4 Total <1.0 n.s. 81.22 .001 9.03 .005 
Responses 

5 Responses, 5.61 .02 40.09 .001 21.02 .001 
Spanish 

Intonation 

6 Word Count, 1.79 >.18 5.41 .03 15.94 .001 
Spanish 

Intonation 

7 Responses, 2.73 >.09 328.95 .001 85.91 .001 
English 

Intonation 

8 Word count, 7.82 .01 54.14 .001 26.56 .001 
English 

Intonation 

Linguistic Capacity Index 

9 Vocabulary 1.19 >.20 96.18 .001 ^1.0 n.s. 
Recognition 

10 Contrastiv© 3"^. 05 70.19 .001 ^1.0 n.s. 
Phonology 

11 Contrastive <1.0 n.s. 116.71 .001 ^1.0 n.s. 

Grammar 

« ^ ^Conpiled from ott's Figures 1-11, "A Study of Oral 
English," pp. 81, 83. 84, 86, 88, 89, 91, 93, 95, 96 and 97. 
No explanation was given as to why inequality signs for F's 
and P's were used in some tables and not in others. 
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group dropped only about two responses. Again the differ- 
ence in gain scores between the OAE and NOA groups was sig- 
nificant at the .001 level (comparison #3 listed above). 

On Variable 8, Word Coiant, English Intonation, the 
two groups scored almost the same in September (twenty-one), 
but the OAE group increased to 112 in May; the NOA group 
increased only to thirty- seven. The difference in gain 
scores was significant at the .001 level (conparison #3 
listed above). 

These data for the three variables (2, 5 and 8) all 
support the contention that the experimental program results 
in greater growth in oral English than unstructured language 
activities of the non-oral-aiaral approach. 

The gain scores (conparison #3 listed above) dif- 
fered significantly also for Variable 1, Correct Responses 
(p = .001),. .Variable 4, Total Kasponses (p = .005), Variable 
6, ..^rd Covmt, Spanish Intonation (p = .COl) and Variable 7, 
Responses, English Intonation (p = .001). in all four cases, 
the gains were in directions favoring whe OAE group. 

Notice finally that on the Ott«Jameson Teat , the 
differences between the gains of the two groups (con^jarison 
#3 listed above) were significant for all variables except 
y^iable 3 (Missed~on the phonology section), while none 
of the Linguistic Capaci ty Index gain scores was signifi- 
cantly different. 
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After presenting the results of applying analysis 
of variance to the variables, Ott used the findings for 
some of the vdriables to test six hypotheses. 

The first three hypotheses were concerned with 
expressive language. 

Hypothesis Ir There will lae no difference between 
treatment groups in proficiency in language as indicated by 
the ability to reproduce English phonemes accurately from a 
model. On the basis of the results of analysis of Variables 
1 and 2, this null hypothesis was rejected. 

mnx>thesis 2: There will be no difference between 
treatment groups in proficiency in the nae of English, in- 
cluding approT>riate word chdlice, control of syntax, intona- 
tion and juncture. Using the results of the comparisons on 
Variable 7, she rejected the null hypothesis. Given that 
Variable 7 was derived by counting the number of resjionses 
using English Intonation to the twenty-nine questions in the 
second part of the Ott-Jameson Test , it is a little difficult 
to believe that Hypothesis 2 was conpletely tested. 

Hypothesis 3; There will be no differences between 
treatment groups in the extent of pronpt and fluent response 
in English. This was measured by Variable 8, Wbrd Count, 
English intonation. Anain the nxill hypothesis was rejected. 

The second three hypotheses were concerned with 
expressive lemguage. 
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Hypothesis 4; There will be no significant dif- 
ference between treatment groups in niomber of correct 
responses on the Vocabulary Recognition section of the 
Lincfuiatic Capac ity Index . The findings related to Vari- 
able 9 revealed that Hypothesis 4 could not be rejected. 

Hypothesis 5: There will be no difference between 
the treatment groups in nvimber of correct responses on the 
Contrastive Phonology section of the Linguistic Capacity 
Index. Conparisons on Variable 9 did not permit the hypoth- 
esis to be rejected. 

Hypothesis 6 ; There will be no difference between 
the treatment groups in number of correct responses on the 
Contrastive Grammar section of the Lincmistic Capacity 
Iridex. Comparisons on Variable 10 did not permit the hypoth- 
esis to be rejected. 

In summary, both the OAE and NOA groups achieved 
significant gains in eaqjressive language as measured by the 
Ott- Jameson Test and in receptive language as measured by 

Linguistic Capaci ty Index . The gains for the OAE group 
were significantly graater than those for the NOA group in 
eacpressive language, while there were no differences between 
the groups in amount of gain in receptive language. 

Ott felt that the learning es^riences provided in 
the science program used by the NOA group were an "inportant 
factor in maintaining a consistent and e<iual gain with OAE 
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in language proficiency in the Receptive Operant."^ Since 
Ott's conparisons did not include groups which did not 
receive the science curricultun, however, she did not have 
evidence showing that the "regxilar" curriculum and its 
"traditional" language activities would be any less effec- 
tive in producing gains in receptive language abilities. 

Jameson 

Jameson developed an instrument for testing the 
ability of children to produce certain English phonemes. ^ 
Her dissertation will be referred to as "Phonemic Analysis. " 
Her test was prepared with four objectives in mind: 

( 1 ) to obtain further information on the basic 
laiwfuage problems of disadvantaged pre- 
literate Spanish-speaking children in San 
Antonio public schools; 

(2) to develop an effective test for assessing 
the oral English level of these children; 

(3) to develop a test that could be used by a 
classroom teacher with no linguistic back- 
ground, following brief training; 

(4) <-o develop a test -that could measure the 
progress of the child in his discrimina- 
tion and reproduction of oral English, 
between the periods in which the test was 
administered. ^ 



^Ott, "A Study of Oral English," p. 122. 
2 

Gloria R. Jameson, "The Development of a Phonemic 
Analysis for an Oral English Proficiency Test for Spanish- 
Speakang School Beginners," Unpublished Doctoral Disserta- 
tion, The University of Texas, Austin, 1967. 

^Ibid . . p. 5. 
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These objectives can be collapsed in-* ^ one: to pre- 
pare a test siBnple enough for teachers with brief training 
to administer which could be used as a pretest for use in 
diagnosing needs of preliterate children and as a posttest 
for measuring progress. 

In order to develop the instrument and to meet her 
objfjctives, Jameson found it necessary to: 

(1) Review the literature on English tests for 
non-native speakers: 

(2) Review the literature on phonologicc.1 analyses 
of English and Spanish: 

(3) Write an outline of a contrastive analysis of 
English and Spanish: 

(4) Make three revisions of Part I of an oral 
language test first developed by Ott; 

(5) Administer each revision! of the test; 

(6) Make a phonetic and a phonemic transcription 
of twelve of the tapes of the bird revision (the phonemic 
description was used by Jameson in writing the summary 
statements for the data analyses): 

(7) Conpare her method of scoring the tests with 
scoring done by classroom teachers; and 

(8) Analyze and summarize the findings from the 
administration of the third revision. 
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Ott's version of the test was administered to sev- 
enty first grade children (Sairple II) in Project schools in 
September, 1965 (Year Two).^ It "used eight items (groups 
of pictures) to elicit the target response wrds in the 
phonemic problem areas. ott began a revision of this test 
in October but Jameson took over the tesponsibility for the 
revisions in November, ott continued to work on a test for 
other aspects of productive language. 

Jameson's first revision consisted of eighty-seven 
short minimal pair sentences which checked twenty-one pho- 
nemic problems "that had been reported by teachers, or were 
predicted for the native Spanish-speaking child. Colored 
pictures illustrating each sentence were shown to the child 
as he listened to a model saying the sentence. The child 
was to repeat the sentence. Scoring was done later by play- 
ing back the tape on which each child's voice had been 
recorded. / 

Tlie first revision was administered in January, 1966, 
to twen .y-nine first grade children in one Project school who 
had been in school for four months and to seventeen pre- 
school children who had been in school two to four days. 



^According to Ott. the nujnber of children tested was 
seventy-five. Ott. "A Study of Oral English." p. 19. 



2 

Jameson. "Phonemic Analysis." p. 18. 
^Ibid . . p. 19. 
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During administration of the test, it was further revised 
by dropping the use of the pictures - and by omitting one of 
each of the minimal pair sentences. 

The second revision contained fifty-three short test 
sentences checking twenty-five phonemic problems illustrated 
with 110 pictxires. Each sentence was modeled on a tape and 
the child was asked to repeat what he heard. There was one 
word in each sentence that was scored. The score sheet had 
places for three types of responses to be marked: 

(1) the correct word; 

(2) the word with a Spanish sound substituted; or 

(3) an unpredictable response, a response about 
which the scorer could not make a decision, 
or no resp-jnse. 

This test was administered to twenty-one preschool children 
and twenty-two first grade children in one Project school. 

Two teachers, four linguists and two native Spanish 
speakers studying the teaching of English as a foreign lan- 
guage scored one or more of the tests. All scorers agreed 
on words heard and marked aa corr-act. There was disagree- 
ment, however, on how to score the "incorrect" responses. 
This indicated need for a third revision. 

The third revision contained fifty-seven short sen- 
tences covering twenty-five phonemic problems. The sentences 
were independent of each other in meaning. A voice on the 
tape said each sentv^nce leaving a_ pause for a child io 



79 

repeat the s.ntence. The scoring system was the same as for 
the second revision. 

It if» interesting that after several more revisions, 
this test as used by the Southwest Educational Development 
La3x)ratory (SEDL) in the 1970-71 school year (under the title 
Ott Test of Oral Language ) had thirty-three sent'^nces cover- 
ing eleven problem areas. Seventeen of the sentences are 
exactly the same as the ones in Jameson's thxrd revision. 
Another one had tho word price changed to prize . None of the 
eleven problems were different from Jameson's,^ 

The SEDL scoring system had three categories for 
rating a response: correct, minimum change, and no response 
or "I don't know." The minimum change category is the same 
as Jameson's category called "the word with the Spanish pho- 
neme substituted." Then, instead of merely reporting the 
responses of each child or group according to the percentage 
in each category, the SEDL version assigns four points to a 
correct response, one point to a minimum change response and 
zero to the third category. No rationale for this method of 
scoring was given. 

Jameson conpared her scoring of the tests with that 
of thirty- four first, second, ar.d third grade teachers of 
disadvantaged Spanish- speaking children. As with the second 



Southwest Educational Development Laboratory, Ott 
Test of Oral Language. Southwest Educationil Development — 
Laboratory, Austin, Texas, 1970. 



revision, the various scorers agreed on English phonemes 
heard as clearly correct, but otherwise individual scoring 
analyses differed. Despite this, Jameson felt that her 
third objective was met. 

The third revision was administered to the follow- 
ing types of pupils: 

(1) Grade One (seven to eight months in school): 
twenty CAE pupils in San Antonio; 

twenty OAS pupils in San Antonio; 

twenty NOA pupils in San Antonio; 

twenty Control pupils in San Antonio 
(No further description provided); 

thirteen Control pupils in Austin 
(No further description provided); 

eight Anglo-American pupils in Austin 

(native English speakers attending a school 
with an enrollment of more than fifty per- 
cent Spanish-speakers); 

(2) Preschool (three months in school): 
twenty-seven pupils in San Antonio 

(No further description provided); 
fifteen pupils in Austin 

(No further description provided); 

(3) Grades Two through Six (sixteen to fifty-two 
months in school): 



fourteen pupils in Austin 

(No further description provided). 
The major parts of Jameson's dissertation include: 

(1) A contrastive outline of English and Spanish 
which supposedly could be used "as a guide to the teacher 
without a linguistic background."^ This outline appears 
to be somewhat too condensed and overburdened with new 
vocabulary to be useful to teachers as it stands. 

(2) A description of each test item of the third 
revision of the test, including: 

(a) the reason for including the itemj 

(b) likely deviations from the standard 
pronunciation; 

(c) reasons for the deviations; 

(d) data £^r responses of each group tested, 
i.e., the percentage of responses in 
each of the scoring categories; 

(e) a sxammary of the data for each item 
including, among other things, whether 
the results were those expected, and 
relating the results of each item to 
results of other similar items; 

(f ) recommendations for changes in certain 
test items* 



Jameson, "Phonemic Analysis," p. 26. 



82 

This section should be of some value to language program 
planners, especially the data on the San Antonio OAE, OAS 
and NOA groups for whom other data are available, of 
special interest are the findings that preliterate children 
do not have trouble with all the same English phonemes with 
which older children and adults exhibit problems. 

(3) A brief summary of the basic difficulties in 
oral English "discrimination/reproduction" for first 
graders after: (a) three months of school: and (b) seven 
or eight months of school. This summary includes comments 
about the actual errors in coirparison to those predicted. 
For instance, "new vowels in English created much less 
difficulty than predicted. 

Pena 

Pena's study is the only one of all the studies con- 
centrating on oral language which focuses on syntax.^ His 
dissertation will be referred to as "A Study of Syntactical 
Structures." His intention was "to ascertain the status of 
some of the syntactical structures in the oral language 
development in Spanish and English of four groups"^ of first 

^Ibid. , p. 146. 

Albar A. Pena, "A Conparative Stady of Syntactical 
Structures of the Oral Language Status in Spanish and 
English of Disadvantaged Pirst-Grade Spanish- Speaking Chil- 
dren," Unpublished Doctoral Dissertation, The University of 
Texas, Austin, 1967. ' 

3 

Ibid . , p. 2. 
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grade children in five Project schools. His study was done 
during the 1966-67 school year, which was Year Three. At 
that time some of the first grade eacperimental pupils were 
still receiving either special oral language instruction in 
Spanish or in English, but not in both. These groups were 
the OAS and OAE groups. Other escperimental children were 
receiving special instruction in science but no 3tructured 
oral language practice. These children made up the NOA 
group. The control group was at this time enrolled in the 
same schools as the experimental children, but received no 
special science or oral language instruction. Pena referred 
to this group as the NOA-NS group. 

Pena went on to say that his study would focus on 
"an intensive comparative analysis of some basic sentence 
patterns and fundamental transformations in Spanish and 
English manifested in the responses of the subjects at the 
beginning and at the end of the first grade. a secondary 
purpose was to "ascertain the usefulness of a new testing 
instrument purpcri-ing to measure basic language development . 
hoth in Spanish and English. 

The title of his dissertation and his stated inten- 
tions are somewhat misleading in that the major task Pena 
performed was to compare the four groups' initial scores and 

^Ibid . 

^ Ibid . , p. 3. 



their mean gains. These comparisons revealed some informa- 
tion about the effectiveness of tho eacperimental language 
programs, but said little about the relative or absolute 
status of the oral language structures of the children. 

In order to "ascertain the status" of some syntac- 
tic structures and to make a "comparative analysis," Pena 
collected data as follows: 

The pretest instruments, administered in September, 

1966, were: 

(1) The "Spontaneous Language" section of the 
Language-Cognition Test:^ 

(2) The Goodenouah-Harris Draw-A-McUi Test (an 
intelligence test);^ and 

(3) The Test of General Ability, Inter-American 

Series (to measxare reading readiness), English 

3 

and Spanish forms. 

The posttest instnaments , administered in April, 

1967, were: 

(1) The "Sx»ntaneous Language" section of the 
Language-Cognition Test; and 



^Anne O. Stemmler, The LCT, Lanauaae-Coanition Test . 
Research Edition (Mimeographed), Austin, Texas, 1967, pp. 1-2. 

^.L. Goodenough and Dale B. Harris, Goodengugh- 
Harris Draw-A-Man Tf^afc. Harcourt Brace and Wbrld, inc., 
New York, 1963. 

3 

Herschel T. Manuel, Teat of General Ability. Inter- 
American Seriea. Level 1, Primary, Porm CBs, Form deI Guid- 
ance Testing Associates, Austin, Texas, 1962. 
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(2) The Test of Reading. Inter-American Series . 

English and Spanish forms. ^ 
In the Spontaneous Language section of the Lanouaae- 
Cognition Test, the child had two tasks to perform. For the 
first task, the child was handed five different familiar 
concrete objects and asked to name each one and tell every- 
thing he could about it. For the second task, he was handed 
six different pictures and asked to make up a story that 
would go with the pictures or tell what he saw happening in 
the pictures. The child's responses were recorded. The 
test was given twice by the same examiner, once in English 
and once in Spanish. 

Pena could have contributed greatly to the discussion 
on scoring by presenting at least one exanple. A lack of 
clarity occurred because his description in Chapter III, 
Stemmler's discussion in Appendix A and his further comments 
in Appendix D are inconsistent. Apparently each child's oral 
responses were classified and counted. The classification 
system included twenty-two variables, which were listed only 
in the appendices. 

The first six classifications or variables corres- 
ponded to the six general sentence patterns described by 



^Herschel T. Manuel, Test of Reading. int er-American 
gges. Primary Level 1, Form ecs. Form DE, Guidance TeJti^ 
Associates, Austin, Texas, 1962, 
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StocTcwell, Bowen and Martin.^ These six patterns "allow 
the framing of an aliiost endless niunber of sentences each 
containing only a single finite verb."^ This type of de- 
scription of sentences, however, has certain disadvantages. 
For instance, "it does not reveal relationships of parts of 
sentences; it does not show how one kind of sentence may be 
related to another; and it is not extensible" and it can 
lead to a ralyses which are misleading about tmderlying struc- 
tures . ^ 

Th<i other sixteen classifications or variables in- 
cluded the transformations of negation, interrogatives , 
affirmative and negative iirperatives , passives and subjiinc- 
tivesj one-word utterances; functionally conplete sentences; 
functionally incoirplete sentences; Spanish lc«in words; 
English loan words; Hispanicized English words j non-standard 
subject-verb agreement or verb usage; adjectival usage; com^ 
pound sentences; conplex sentences and direct/indirect quo- 
tations. Pena provided no explanation for how responses would 
receive scores in these categories although some are espe- 
cially vague, such as verb usage and adjectival usage. 



Robert P. Stockwell, J. Donald Bowen, and John W. 
Martin, The Grammatical structures of English and Spanish . 
The University of Chicago Press, Chicago, 1965. 
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After the pretest data were collected, the first 
task Pena performed was a factor analysis on the frequencies 
of the various responses from the Language-Cognition Test. 
The purpose was to reduce the twenty-two variables to a more 
manageable number of scores, which were then referred to as 
the LCT factor scores. 

Pena then applied analysis of covaricjice to the LCT 
factor scores, using the Draw-A-Man Test IQ scores as the 
covariable. The purpose of this analysis was to find out if 
there were significant differences between the groups, in- 
cluding sex, on the pretest factor scores. 

After the post test data were collected, Pena per- 
formed another factor analysis on the frequencies obtained 
the Lancruaae-Coanition Teat. Then he used repeated mea- 
surement analysis of variance to determine if there were 
significant differences between the factor scores at pre- 
and posttest for each sex, group and sex group. Pena's 
analysis is somewhat limited by his failure to use factor 
matching techniques or to base his comparisons on raw data 
rather than factor scores. Factor matching techniques would 
have indicated the extent to which the factors in the two 
analyses (pre- and posttest) agreed with one another. 

The last thing Pena did was to determine the extent 
to which correlation existed between the posttest LCT factor 
scores in English and Spanish and the Test of Rgadint^ . 
Inter-American seriiaA. 
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Pena apparently never used the scores from the Test 
of Gene ral Ability. Inter- American Series which he collected 
as pretest data. 

Pena chose his subjects randomly from twenty-three 
first grade classrooms in five of the schools in the Project. 
All were native Spanish speakers. He originally chose 
twenty- two pupils in each of the four treatment groups. Due 
to attrition, he ended with sanples of sixteen from each 
group, eight boys and eight girls. The total N of sixty- 
four pupils may be too small to produce a reliable estimate 
of population variance for multivariate (factor analysis) 
techniques . 



twenty- two variables of the Languaae-Coanition Test given 
as a pretest were reduced to six LCT factors named as fol- 
lows: 



(2) Functionally Conplete Sentences; 

(3) Basic Sentences with English Loan Words; 

(4) Single Words; 

(5) Verb Usage; and 

(6) Combined Conplete and Inconplete Basic 
Sentences. 



Following is a siimmary of Pena*s results: The 



Spanish ; 



(1) 



General Sentences and Transformations; 



EnolisJi: 



(1) 



General Sentences; 



(2) 



Passive Transformations; 
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(3) Sentence Pragmentsj 

(4) Functionally Complete Sentences; 

(5) Simple Transformations; and 

(6) Lack of Negative Transformations, 

In his coiiparison of Goodenoxigh-Harris IQ scores, 
he foimd no significant differences between the four 
treatment groups. In his analysis of covariance using IQ 
as the covariable, his purpose was to discover if there were 
significant differences on the LCT factor scores among the 
groups with the boys and girls separated and between the 
boys and girls in each group. He did not look for signifi- 
cant differences between groups with the boys and girls to- 
gether. 

For the Spanish factors, analysis of covariance was 
not justified for Factor 1. He found significant differences 
among the groups with boys and girls separated for Factor 2 
(p = .001), Factor 5 (p = .05^, and Factor 6 (p = .03) and 
between the boys and girls in each group only for Factor 4 
(p a .02). He attenpted to accoiint for these differences by 
presenting a table of adjusted means for each treatment group 
divided by sex and by explaining which group (s) caused the 
major significance. 

For the English factors, analysis of covariance was 
not justified on Factors 1 and 4. He fo\ind a significant 
difference among the groups with boys and girls separated 
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only on Factor 6 (p = .032) and no significant difference 
between boys and girls in each group. 

All in all, the majority of the findings revealed 
no significant differences in the pretest scores, including 
sex, with IQ as the covariable. Pena concluded that: 

In general the groups were foiind to be rela- 
tively similar in their linguistic performance. 
This conclusion can probably be accounted for 
by the kind of scoring used.^ 

He stated further that: 

The extremely fundamental nature of the basic 
sentences and the few transformations employed 
in the linguistic categories contained the 
Linguistic Analysis Pom did not, apparently 
permit linguistic distinctions among the groups 
to be arrived at statistically. . . . Conse- 
quently, any future analysis will have to 
employ more sophisticated linguistic categories 
(einbeddings in particular, for syntax) and a 
much more in-depth sanantic analysis. 2 

The twenty-two variables on the posttest administra- 
tion of the Lanouaae -Coanition Teat were also reduced to six 
factors for both the English and Spanish versions. The LCT 
factors were named as follows: 



Spanish: 



(1) 



General Sentences and Transformations; 
Transformations and Complex Sentences; 
Single Words and Sentence Fragments; 



(2) 



(3) 



(4) 



Basic Sentences and Subjunctive Trans- 
formations; 



(5) 



Verb Usage; and 



Pena, o£. cit. , p. 109. 



^ Ibid . . p. 108. 
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(6) Single Sentences* 
English ; (1) General Sentences; 

(2) Complex Sentences; 

(3) Coirplex Sentences and Sinqple Trams- 
formations; 

(4) Basic Sentences and Complex Trans- 
formations; 

(5) Single VIbrds and Sentence Fragments; and 

(6) Passive Transformations* 

In order to perform repeated measixrement analysis of 
variance on the pre- and posttest factor scores, Pena had to 
^pply the factor weights obtained from the pretest factor 
ancdysis to the posttest variables. This was done to make 
the two sets of factor scores c^nparable* 

Then Pena tested the six English and six Spanish 
factor scores with three hypotheses: 

(1) There are no differences between the LCT 
factor scores at T^ (pretest) and T2 (posttest) for each 
sex; 

(2) There are no differences between the LCT 
factor scores at Tj^ and T2 for each group; and 

(3) There are no differences between the LCT 
factor scores at and Tj for each sex-group combina- 
tion.^ 



^Ibid . . p. 80. 



These mean that Pena was coirparing the LCT factor 
scores of all groups during pre- and posttesting to deter- 
niine if there were any differences between testing times 
as a function of treatment and sex.^ 

Pe.ia fovind that he had to accei 1 chree null 
hypotheses (p>.05) for each factor except: 

(1) Spanish Factor 3, second hypothesis (p = .02). 
A comparison of the means of the groups on the factor indi- 
cated that the mean increases over time were not the same. 
The NOA-NS group showed the greatest increase. 

(2> Spanish Factor 6, third hypothesis (p = .03). 

(3) English Factor 5, second hypothesis (p = .006). 
Again the NOA-NS group showed the greatest mean increase. 

Pena concluded from these findings that: 

. . . regardless of the treatment used the results 
obtained will be essentially the same. There 
would seem to be several e3q)lanations for this 
conclusion. First, since all the groups are being 
tataght with essentially the same program except 
for the intensive one-hour-per-day instruction in 
oral language for the CAE and OAS groups, the 
opportunities for differences in oral language 
to be esqpressed by the four treatment groups 
would probably be minimal during the first year 
of academic training. Second, it appears pos- 
sible that the oral language program in its 
present stage is not intensive enovigh in its 
coverage of the level of syntactic maturity of 
which the child of this age is capable, i.e., 
simple relative transformations, siitlple deletions, 
and so on. Third, instif f icient transfer of these 
types of sentences into other general areas, i.e., 
fields other than science and the self -concept , 
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which the Spontaneous Language Section of the LCT 
ejqplored, may have occurred. Finally, it is pos- 
sible that the syntactical analysis performed may 
r^t have had sufficient depth to encompass all the 
differences manifested in the responses of the 
subjects being tested. 

The last analysis pena performed consisted of deter- 
mining the correlations between each of the posttest Spanish 
ICT factor scores and the Spanish version of the Test of 
Reading. Inter-Am erican Series and between the posttest 
English ICT factor scores and the Eiiglish version of the 
Test of Reading , Inter-American Series . 

Two Spanish factors had significant correlations with 
the reading test: Factor 1, General Sentences and Transfor- 
mations (r = .3910) and Factor 2, Transformations and Conplex 
Sentences (r = .2157). 

Two English factors had significant correlations with 
the reading test: Factor 1, General Sentences (r = .4160) 
and Factor 3, Complex Sentences and Simple Transformations 
(r = .3789). 

Pena concluded that the first section of the Lanouaae- 
Coanition Tf^at- 

. . . did yield evidence of oral language judging 
by the considerable number of responses obtained 
m each linguistic category for each subject. . . . 
The test also proved to have great appeal for chil- 
dren, judging from the enthusiasm that was ex- 
pressed toward the items^and pictures used to 
elicit "free" responses. 2 



^Ibid . . pp. 109-110. 
^Ibid . . p. 108. 
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He also recoinmended that more complicated linguistic 
analyses be done and that programs of intensive oral lan- 
guage instruction include more sophisticated transformations 
in the first year and continue more than one hour a day. 

Studies on Reading 

Arnold 

Arnold con5)ared the scores on reading tests adminis- 
tered to the same children in the spring of second grade 
(May, 1966) and the fall of third grade (September, 1966).^ 
His sample consisted of 287 children randomly selected from 
approximately 700 children who were in Sainple I (children 
who entered first grade in the first year of the Project, 
1964-65). The tests used were Metropolitan Achievement 
Tests, Primary Level ll; Test of Reading. Inter-American 

Series , Level 2; and Prueba de Lectura, Serie Interamericana , 
2 

Nivel 2. The purpose of making the comparisons was to 
determine if any significant changes had occurred during 
the summer months and if the treatments (O/^E, OAS and NOA) 

^Richard D. Arnold, "Retention in Reading of Disad- 
vantaged Children," in Reading and Realism , ed. by J. Allen 
fifJ^^®^' ^"tematlonal Reading Association, Newark, Delaware, 



^Walter N. Durost (ed. ), Harold N. Bixler, Gertrude 
H. Hildreth, Kenneth W. Lund, Metrotxalitan Achievement Teats . 
Primary, Level II,Harcourt, Brace and World, New York, 1958; 
Herschel T. Manuel, Test of Reading. Inter-American Series , 
ffc?^^' ^' Guidance Testing Associates, Austin, Texas, 

1962; and Herschei T. Manuel, Prueba de Lectura. Serie Inter- 
americana, Primario Nivel 2, Guidance Testing Associates, 
Austin, Texas, 1962. 
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differed with respect to the magnitude or direction of the 
change. He used the t-test for differences between corre- 
lated means to test for significant differences between the 
spring and fall scores on each test for each treatment group. 
He also computed difference scores for each pupil and then 
used the t-test for differences between independent means to 
compare the means of the difference scores on each test for 
each pair of treatment groups. 

To present his results, T^nold constructed tables 

showing: 

(1) Hhe differences between the spring and fall for 
each test and treatment. (Each test had three svibtestsj the 
scores for each sxabtest and a total score were included in 
the analyses . ) 

(2) Comparison of the NOA and OAS difference scores 
for each test. 

(3) Comparison of the NOA and OAE difference scores 
for each test. 

(4) Comparison of the OAS and OAE difference scores 
for each test. 

The performance of the NOA group declined signifi- 
cantly between the spring and the fall on all three tests. 
The OAE pupils more or less maintained their position. The 
OAS group intproved its scores significantly on the Test of 
Reading. Inter- American Series and shoved gains on the 
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Arnold concluded that the results appeared to indi- 
cate that the oral-aural treatmfcnts facilitated retention 
in reading over the summer. Why the OAS group showed greater 
gains on an English reading test than the OAE group was an 
xinexplained phenomenon. 

Fowler 

Fowler's study evaluated the Brengelman-Manning 
Linguistic Capacity Tr.ri^^ as a predictor of reading achieve- 
ment as measured by the Word Knowledge, Word Discrimination 
and Reading subtest scores of the Metropolitan Achievement^^ 
Tests. ^ She tested a random sarrple of first grade students 
in eight Project schools during Year Four (1967-68). These 
pupils were in treatment groups receiving instruction with 
curricula similar to that of trie Saitple I and II classes 
except that the lessons were taught either in English or 
bilingually (English and Spanish), if the lessons were 
taught bilingually, either one teacher taught the lessons in 
both languages or a bilingual teacher and a monolingual 
(English-speaking) teacher shared responsibilities for two 
classes. In addition to the original language lessons based 
on science content, there were also oral language lessons 



^-"-f^^® °- Fowler, "The Brengelman-Manning Linguistic 

i;^ln^L^^'^^P•^^^''^!^°^°^ °^ ^^^^^9 Achievemint Sfsl^! 
ish-Speaking First Grade Children," Unpublished Doctoral 
Dissertation, The University of Texas, Austin, 1969i Walter 
^nS^S^w' f'^ ""^^2^^ Gertrude H. Hildr;th? 

SS?!v™;^^!f ^""t ^' Wrightstone, Metropolitan 

^^ir^H^ P^^"^^ I Battery, Form C, HaSourt, 

Brace and World, Inc. , New York, 1961. 
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based on social studies concepts and lessons designed to 
develop self -concept. 

The original sairple was a random selection of 152 
subjects from thirty-seven classrooms. Due to attrition, 
the final sairple size was ninety-nine students. Fowler did 
not report the number of classrooms these students repre- 
sented. 

The following data were collected to seirv^e as inde- 
pendent variables in the statistical analyses: 

(1) Scores on the Goodenouqh-Harris Draw-A-Man Test , 
administered in September, 1967. The Draw-A-Man Test is used 
to estimate intelligence. It requires no verbal responses 
from the child. 

(2) Scores on the Brengelman-Manning Linguistic 
Capacity Index, administered in September^ 1967. This test 
measures receptive language ability: Vocabulary, Phonology 
and Grammar and Total. 

(3) Total scores on the Metropolitan Readiness 
Tests, Form A, administered in January, 1968. The total 
score is a summation of the scores for Word Meaning, Listen- 
ing, Matching, Alphabet, Numbers and Copying. 

The following data were collected to serve as depen- 
dent variables: Word Knowledge, Word Discrimination and 
Reading scores of the Metropolitan Achievement Tests , ad- 
ministered in April, 1968. In the Word Knowledge subtest 
the child chooses a word to match a picture. The Word 
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Discrimination sxabtest rneasures the ability to choose the 
correct written word to match one pronovmced by the exam- 
iner. The Reading subtest measures conprehension of sen- 
tences and paragraphs. 

The following null hypotheses were tested: 

(1) There will be no significcuit correlation be- 
tween the four Linguistic Capacity Index scores and the three 
Met ropoli tain Achievement siabtest scores. 

(2) There will be no significant gain in predictive 
efficiency when the Lincruistic Capacity Index measures are 
added to information about the subjects^ intelligence, as 
measvrc.vi by the Goodenough-Harris Draw-A-Man Test , sex and 
treatment group membership. 

(3) There will be no significant gain in predictive 
efficiency when the Linguistic Capacity Index measures are 
added to the total scores of the Metropolitan Readiness Tests . 

For Hypothesis I, since there were four different 
Linguistic Capacity Index scores and three Metropolitan 
Achievement subtest scores, there were twelve different cor- 
relations to be computed. The null hypothesis was rejected 
since all the correlations were significantly different from 
zero at the .0001 level. They ranged between r = .46 for 
Word Discrimination ( Metropolitan Achievement Tests ) and 
Phonology ( Linguistic Capacity Index ) and r « .58 for Read- 
ing (Metropo litan Achievement Testa ) and the total Linguistic 
Capacity Index score. These correlations compare favorably 



^1 

with those generally found between reading readiness and read- 
ing achievement.^ 

For Hypothesis II, Fowler presented a table entitled 
"Summary of Evaluations of the Usefulness of each Brengelman- 
Manning Linguistic Capacity Index Predictor when Used with 
Sex, Intelligence, and Treatment Group Membership. "2 This 
table showed for each Linguistic Capacity Index score pre- 
dicting each criterion of reading achievement the following: 
(1) the efficiency of a prediction system containing informa- 
tion about intelligence, sex and treatment group membership 

Linguistic Capacitv lnd«:^x measure: (2) the efficiency 
of the same system without the Linguistic Capacitv Index 
score; and (3) the predictive usefulness of the Linguistic 
Capacity indpx variable. The F's were all significant beyond 
the .0005 level indicating that the Linguistic Capacity index 
scores were all significant in increasing prediction of 
Metropolitan Achie vement Teat scores. Thus the null hypoth- 
esis was rejected. The additional variances accounted for 
^ Lincfuistic Capacity Tndi:>v scores ranged from eleven 
percent for Reading ( Metropolitan Achievement Tests ) and 
Phonology (Unguisti c Capacity Tnri^^ ^ to twenty-three 



«-i^4.,-„ J^^^-^ Weintraub. "Readiness Measures for Pre- 
?iSriS^ Reading Achievement." The Reading T^arh^r . 20:551 
R^S5^;>,^-^^2' ^^""^ ^' Hillerich, "An Interpretition of 
(ApSl 1966^^ Readiness." Elementary Ennlii^h . 43:360 



^Powl 



er. 2£. sit., p. 105. 
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percent for Word Knowledge ( Metrorxplitan Achievement Tests ) 
and Grammar ( Linguistic Capacity Index ). 

To display the results of testing Hypothesis III, 
Fowler presented a table entitled "Summary of Evaluations 
of the Usefulness of Each Brengelmcin-Manning Ling\iistic 
Capacity index Predictor when Used with Reading Readiness 
Scores as Measured by the Metropolitan Readiness Tests. 
This table showed for each Lincmistic Capacity Index score 
predicting each criterion of reading achievement the follow- 
ing: (1) the efficiency of a prediction system containing 
the Metropol itan Readiness Tests total score and the Lin- 
guistic Capacity index measure; (2) the efficiency of the 
same prediction system without the Lincmistic Capacity index 
variable; and (3) the predictive usefulness of the Linguistic 
Capacity Index variable. The P's showed that six of the 
twelve usefulness valves were not significantly different 
from zero. If .05 is considered the acceptable level of sig- 
nificance, the six values significantly different from zero 
were: 

(1) MAT Reading and LCI Phonology: p = .03 

(2) MAT Word Knowledge and LCI Grammar: p = .02 

(3) MAT Reading and LCI Grammar: p =* .004 

(4) MAT Word Knowledge and LCI Total Score: 
p = .02 



Ibid . . p. 108. 



101 

(5) I-IAT Word Discrimination and LCI Total Score: 
P = .04 ^ 

(6) MAT Reading and LCI Total Score: p = .004 
Notice that the predictive usefulness of the Linguistic 
Capacity Index Total Score was significantly greater than 
zero for predicting each one of the Metroipolitan Tests sub- 
test scores. Nevertheless, the null hypothesis could not 
be rejected in its entirety. 

In summary. Fowler found: 

(1) Significant correlations between each of the 
Linguistic Capac ity Index scores (Vocabulary, Phonology, 
Grammar and Total) and three subtests of the Metropolitan 
Achievement Tests (Word Knowledge, Word Discrimination and 
Reading } . 

(2) Significant gain in predictive efficiency when 
Linguistic Capaci ty Index siobtest and total scores were 
added to information about the subjects' intelligence (mea- 
sured by the Goodenough-Harris Draw-A-Man Test ) . sex and 
treati ;ent group meriJ^ership in order to predict reading 
achievement as measured by the Metropolitan Achievement 

. Tests . 

(3) Significant gain for six of twelve possibilities 
Linguistic Capac ity Index subtest and total scores were 

added to Metropol itan Readiness Test scores in order to pre- 
dict reading achievement. 
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Fowler presented the results of her data analysis in 
a clear and straightforward manner. Addition of certain 
items would have made the study somewhat more informative. 
For instance, before listing the correlations necessary for 
testing Hypothesis I, she might have given the means and 
standard deviations on each test for the total sanple, for 
the boys, for the girls and for the members of each treatment 
group. 

Second, since she was asking about the predictive 
usefulness of the Linguistic Capacity Index in a system 
which included information about group, sex and IQ, it would 
have been useful if she had eaqjlained in what way group, sex 
and IQ each contributed to the prediction. 

Third, since treatment group membership was an im- 
portant factor, then the treatments should have been de- 
scribed in greater detail. Moreover, anyone wishing to 
replicate the study would be unable to do so, given only the 
information about the experimental treatments in Fowler's 
study. 

PaucX 

Pauck's study primarily evaluated Part II (Spanish- 
English Language Fluency or SELF-Test ) of the Ott-Jameson 
Test as a predictor of reading achievement of Spanish- 
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spealcing first grade children.^ Reading was measured by the 

Word Knowledge/ Word Discrimination, and Reading subtest 

scores of the MetroTX>litan Achievement Tests . He also 

attempted to determine the predictive effectiveness of the 

Metropo litan Readiness Tests . Form A, for that population. 

Finally he compared the effectiveness of the two predictors. 

His study will be referred to as "Evaluation of SELF-Test." 

The sanple Pauck tested was identical to the one 
2 

described by Fowler. The only difference was that Pauck' s 
final sanple size was reduced to ninety-seven, two less than 
Fowler ' s . 

The SELF-Test and the Metropolitan Readiness Tests 
were administered in January, 1968. The Metropolitan 
Achieve ment Tests were administered in April, 1968. The time 
period between administration of the instnaments to be used 
as predictors and the administration of the achievement test 
was so short that the findings may be trivial. 

Pauck tested five null hyjxDtheses: 

(1) The correlation between total word count from 
the SELF-Test and scores on the reading sxibtests of the 
Metropolitan Achievement Tests (MAT) is zero; 

^Fredrick G. Pauck, "An Evaluation of the SELF-Test 
as a Predictor of Reading Achievement of Spanish-Speaking 
First Grade Children," Unpublished Doctoral Dissertation, 
The University of Texas, Austin, 1968. 

o 

^Fowler, "Evaluation of the Linguistic Capacity 
Index." ^ f X 
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(2) The correlation between the total scores on 
Metrotaolitan Readi ness Tests and the reading subtests 

of the Metropol itan Achievement Tests (MAT) is zero; 

(3) The total word count from the SELF-Test is not 
a significant predictor of reading achievement as measured 

^ t^ Metropolitan Achi evement Tf^sts when sex., treatment and 
intelligence are statistically controlled; 

(4) The Metropolitan Readiness Tests total score is 
not a significant predictor of reading achievement as mea- 
sured by the Metropol itan Achievement Tf^ats when sex, treat- 
ment and intelligence are statistically controlled; and 

(5) There is no significant difference between total 
word count from the SELF-Test and the Metropolitan Readiness 
Tests as predictors of reading achievement as measured by 
thie Metropolitan Achievement Tests . 

Intelligence was measured by the Goodenouqh~Har ri a 
Draw-A-Man Teat, administered in September, 1967. Pauck 
included a brief description of the oral language programs 
the children participated in between January and April which 
were the "treatment" mentioned in the hypotheses. He gave 
no account, however, of the type of reading instruction being 
used. He did not say when during the year formal reading 
instruction began. This lack of information makes it unwise 
to generalize Pauck' s findings to any other population or 
situation. 



'J 
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The SELF- Test is that part of the Ott-Jameson Test 
where children are asked a series of questions about what 
they see in pictures on a filmstrip. The questions were 
said to represent three levels: literal, non-literal/ 
inferential and imaginative. 

The SELF-Test was contained in a two-track tape 
cartridge so that the questions are asked on one track and 
the child's answers are recorded on the other. Scoring was 
done by counting the words each child said in responding to 
the questions, hs Pauck mentioned, the fact that the child 
is given only a limited time to answer before another ques- 
tion is asked, means the child may be penalized if he wishes 
to think about his a;iswer before responding.^ 

Pauck reported that he had originally planned to use 
two types of scores from the SELF-Test in his data analysis: 
Word Count, Spanish Intonation and Word Count, English Into- 
nation. He found, however, that the children responded 
almost exclusively with Spanish intonation.^ This finding 
raises questions about Ott's scoring procedures in adminis- 
tering the same test two years before in the same schools. 
She found a distinction between responses using English into- 
nation and those using Spanish intonation.^ 



Ibid . . p. 151 
' ibid . . p. 16. 

'ott, "A Study of Oral English," 
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Pauck's primary method of analysis consisted of: 

. . .correlation techniques and prediction ^rom 
correlations by means of regression equations, 
incorporating testing for the significance of 
the correlation coefficient. ^ 

Although he reported in an appendix the raw scores 
of each child on each test, he did not provide means and 
standard deviations for the variables being correlated as 
McNemar suggested is appropriate for correlational studies.^ 

In testing Hypotheses 1 and 2, Pauck found these 
correlations: 

MAT MAT MAT 

Word Know . Word Disc . Reading 

SELF-Test . 627 . 597 . 657 

MetroTX)litan .598 .599 .537 

.Readineag Tests 

In tastix^r I^^thesis 5 he coopared thie SELF-Test and the 

ItetropQlitan ReAd ineas Teats as predictors and found that the 

differences between the correlation coefficient of each MAT 

subtest with the Metropolitan Readiness Tests and the SELF- 

Test were non-significant. He concluded that the SELF-Teat 

and the Metror>olitan Readiness Teats are approximately equally 

good predictors of reading achievement as measured by the 

Metropolitan Achievement Testa . 

The results of testing Hypotheses 3 and 4 showed 

that the correlations between the SELF-Test and the MAT 



^Pauck, "Evaluation of SELF-Teat," p. 19. 

^Quinn McNemar, Pavchological Statistics . 3rd ed. , 
John Wiley and Sons, Inc., Ifew York, 1962, p. 168. 
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reading subtests and between the Metropolitan Readiness 
Tests and the MAT reading subtests are "indicative of a 
relationship largely independent of sex, treatment, and 
intelligence, 

Pauck felt that, "In light of the above conclusions 
it would appear that there exists a high degree of associa- 
tion between oral English fluency and reading achievement, "^ 
This assumes, arncng other thincs, that fluency is adequately 
measured by a word count in a timed test situation. 

One possible expr.anation fox the lack of pronounced 
improvement in the reading abilities of the children' in the 
groups receiving language instructior is that the content 
of the language materials was not similar to the material in 
the basal reading programs used in the San Antonio schools. 
In order to. combat this problem, Arnold, in conjunction with 
his students at The University of Texas at Austin and the 
Southwest Educational Development Laboratory, developed some 
beginning reading lessons which had the same content as the 
oral language materials. The children read no sentence that 
they had not previously learned to produce in oral language 
lessons,^ 



•*-Pauck, "Evaluation of SELF-Test," p, 150, 
2lbid, , p, 156, 

^A more complete description of the experimental 
reading program can be foxand in Richard D, Arnold, "English 
as a Second Language," The Reading Teacher . 21:634-639 
(April, 1968), 



\ 
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Melton 

Melton used the Word Knowledge, Word Discrimination 
and Reading subtests of the Metropolitan Achievement Tests , 
Primary I Battery, Form A, administered in May, 1968, to 
compare reading achievement of three classes of first grade 
children who received instruction with the experimental 
reading materials to the reading achievement of three classes 
of children who received instruction in reading with the 
basal reading program.^ All six classes received instruction 
in the experimental oral language materials. 

Melton's study was done in Year Pour (1967-68). By 
this time the first grade classes in the Project were re- 
ceiving oral language ins tame tion either in an ESL (English 
as a Second Language) mode or bilingually. Melton did not 
say what type of oral language classes her research saitple 
represented. Nor did she have the information to describe 
the time spent in reading instanaction on a daily basis in 
each classroom. She was not able to report whether or not 
the teachers using the e3q>erimental reading materials used 
only those lessons or used basal readers in addition. 
Another unknown variable was the time of year that reading 
instruction was begun by each teacher. 



Jo Anne C. Melton, "A Comparison of Two Methods of 
Teaching Reading to Disadvantaged Mexican- American First 
Grade Children," Unpublished Master's Thesis, The University 
of Texas, Austin, 1968. 
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Melton used one-way analysis of variance to compare: 

(1) The mean scores of all the children in the 
e:q>erimentcl reading classes with those in the basal reading 
classes; 

(2) The mean scores of the classes using the experi- 
mental reading lessons; and 

(3) The mean scores of the classes using the basal 
reading program. 

On all three subtests and total reading score the 
basal reading pupils scored higher than the experimental 
reading pupils. The differences were statistically signifi- 
cant at the .05 level for all comparisons except Word Dis- 
crimination. There were significant differences at the .05 
level among the three escperimental reading classes on Word 
Discrimination and Reading sxibtests and total mean scores 
but not on Word Knowledge* Tlaere were significant differ- 
ences at the .001 level among the three basal reading groups 
on all three subtests and the total score. 

Melton concluded that if a standardized reading test 
is the criterion for success in reading achievement, "the 
basal reading program, despite great variation among class- 
rooms, is the better method to teach this group of bilingual 
Mexican-American children to read."^ She labeled this a 
tentative conclusion in light of several limitations of this 



study, among them the small n\imber of classrooms involved 
and the fact that no pretest measure to assess initial 
ability of the various classes was used in the analysis. 

The fact that the children in the basal reading 
groups performed significantly better on the Metropolitan 
Achievement Tests is not entirely a surprise since the con- 
tent of the reading tests bears much more similarity to the 
material in the basal reading program than to that in the 
escperimental reading program, cook and Arnold collaborated 
in the development of a reading test with greater content 
validity.-^ They constructed two forms of a test with a 
format similar to that of the reading sections of the Metro- 
politan Achievement Tests . 

Cook 

In the Word Knowledge subtest of the Cook-Arnold 
Reading Test, the child marked the one word from four 
written ones which best matched a line drawing, in the Word 
Discrimination subtest the child was supposed to mark the 
one word out of four which the examiner read orally. Each 
subtest included thirty- five items. According to Codk, the 
words chosen for both of these tests came from the vocabu- 
lary of the escperimental reading program. A few other words 
were used as distractors. 



Marilyn H. Cook, "The Development of a Reading Test 
for an Experimental Reading Program for Disadvantaged Chil- 
dren," Unpublished Master's Thesis, The University of Texas, 
Austin, 1969. 



The first section of the reading siibtest had thir- 
teen items in which the child chose one of three sentences 
which best described a line drawing. The second section 
consisted of eight items in riddle form. The child selected 
one of three words which best couplet ed the riddle. The 
sentences approximated those in the eaqperimental reading 
program. 

Examination of copies of both forms of the Cook- 
Arnold Reading Test included in the appendices of Cook's 
thesis showed that the tests were not professionally p\ab- 
lished and were more (difficult to read than the usual 
standardized tests. 

The Cook-Arnold Reading Test was administered in May 
to four classes using the experimental reading programs and 
four using the basal reading program. Arnold reported that 
the classes were the same ones tested by Melton.^ Unfor- 
tunately, due to loss of some of the tests, data were not 
analyzed for two of the basal reading classes. 

Cook confuted and reported item difficulties and 
.ir.em-scale intercorrelations for each item. One item had a 
Correlation below .10. Ten items had correlations between 
.10 and .2399 (Ci^ok reported this range to be between .10 
and .72 but this must have been a typographical error.) 
Eighty items had correlations between .24 and .73. Cook 



Conversation, April 21, 1972. 
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reported reliability coefficient of over .80 for each sec- 
tion of the test and of .939 for the total test score. 

Cook considered the validity and reliability of the 
Cook- Arnold Reading Test high enough to warrant its use in 
coinparing the reading achievement of the two treatment groups. 
The mean scores of the children in the experimental reading 
classes were higher than the mean scores of the basal read- 
ing classes for all the subtests and the total, but the dif- 
ference was significant only for the reading subtest (p = 
.020). This svibtest "was essentially a measure of conpre- 
hension.^mius, it appears that the prototype ejcperimental 
group was able to understand more f roia their reading than 
was the basal reading group. This is especially encour- 
aging, if the test content was valid for both types of 
reading programs, in view of the fact that the escperimental 
reading program was not complete and the teachers had to 
develop many of their own materials. 

Cook found significant differences among the mean 
scores of the classes in the esqierimental reaaing program 
for Word Knowledge (p = . 0019 ) , Word Discrimination (p ~ 
.0019), Reading (p = .0001) and Total (p = .0003). She 
found significant differences between the two classes in 
the basal reading program in Wbrd Discrimination (p = .003), 
Reading (p = .003) and Total (p ^ .00l'»; 



Ibid., p. 73. 
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Socioeconomic Factors 

MacMillan 

MacMillan's study investigated the relationships 
between certain socioeconomic factors and school achievement 
and attendance.^ He also compared the attendance of three 
ethnic groups using teirperature and precipitation as con- 
comitant variables. His report will be referred to as 
"Socioeconomic Factors." 

The following hypotheses (stated in null form) are 
those investigated by MacMillan: 

(1) The following socioeconomic independent vari- 
ables are not significant predictors of the child's school 
achievement : 

(a) occupation of father, or mother, if the 
major wage earner of the family; 

(b) size of the family; 

(c) family organization, nuclear or atypical; 

(d) sex of the child; 

(e) preschool e3q>erience; and 

(f) the child's number of days of at -^ndance 
in school; 

(2) The following socioeconomic independent vari- 
ables are not significant predictors of the child's school 
attendance : 



•^Robert W. MacMillan, "A Study of the Effects of 
Socioeconomic Factors on the School Achievement of Spanish- 
Speaking School Beginners," Unpublished Doctoral Disserta- 
tion, The University of Texas, Austin, 1966. 
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(a) occupation of the father, or mother, if 
the major wage earner of the family: 

(b) size of the family; 

(c) family organization, nuclear or atypical; 
and 

(d) sex of the child; 

(3) The following factors are not significant pre- 
dictors of the child's school attendance: 

(a) membership in the Mexican- American ethnic 
group; 

(b) membership in the Negro ethnic group; 

(c) membership in the Anglo ethnic group; 
(d. membership in upper, middle, or lower 

social group; 

(e) amoxint of daily precipitation; and 

(f ) daily tenperature.^ 

For the first two hypotheses, the saitiple consisted 
of 305 first grade pupils in the Project for whom there were 
available conplete test data and coitplete socioeconomic data. 
The study was done during Year One (1964-65). For the third 
hypothesis, the sample was drawn from five Mexican-American 
schools, four in the Project and one random; four Negro 
schools and three Anglo schools. Four of the Mexican- 
American schools and all the Negro schools were categorized 



Ibid . . pp. 14-17. 
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by the San Antonio Independent School District as being in 
the low socioeconomic class. The three Anglo schools were 
in the upper-, middle-, and lower-class categories. 

Socioeconomic data were collected from pupil ques- 
tionnaires constructed by the Project staff and filled out 
by the classroom teachers. Attendance data were collected 
from school registers and teacx.ar reports. For the third 
hypothesis, the days counted for attendance purposes were 
the same 140 days as those in which the experimental lessons 
were taught: September 21, 1964 to April 27, 1965. 

Test data were collected from the Metropolitan 
Readiness Tests, Form A; Prueba de Habilidad General. Serie 
Interamericana. Primario, Nivel 1; and the Goodenouqh-Harris 
Draw-A-Man Test. For Hypothesis 1 the achievement measure was 
obtained by converting the posttest Metropolitan Readiness 
Tests scores and the pretest Prueba de Habilidad General 
scores to standard scores and subtracting pretest and post- 
test. The rationale for this procedure was not given. The 
primary method of analysis was multiple linear regression. 
MacMillan found that: 

(1) The combination of socioeconomic variaJ^les of 
parent's occupation, family size and brganization, pupil 
sex, preschool experience and attendance are predictors of 
school achievement; 

(2) IQ scores and pretest scores were significantly 
correlated with achievement; 
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(3) Parent occupation and pupil attendance were 
equally important predictors of achievement j 

(4) Preschool e^erience did not quite reach sta- 
tistical significance (p = .0579) as a predictor, although 
it should loe considered educationally important j 

(5) Family size and organization were not predic- 
tors of achievement; 

(6) Sex as a single factor was not a significant 
predictor of achievement;^ 

(7) Individually, none of the socioeconomic vari- 
ables were important predictors of attendance;^ 

(8) Significant differences in attendance patterns 
were found among the three socioeconomic groups in the Anglo 
ethnic group; 

(9) Among the Mexican- American ethnic group the low 
socioeconomic group attended school more frequently than the 
middle group regardless of weather conditions. He specu- 
lated that this reflected the drawing power of the Project;^ 

(10) Attendance of the Negro ethnic group was nost 
affected by weather conditionsj^ 



^Ibid . , p. 222. 
^ Ibid . , p. 223. 
^Ibid . , p. 224. 
^Ibid . 

^ Ibid . , p. 225. 
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f 

(11) If attendance is an indicator of attitude, the 
Mexican-American first grade child has as positive an atti- 
tude as does the Anglo child."'" 

MacMillan also provided a review of the demographic 
and sociological literature pertaining to the Mexican- Ameri- 
can of the Southwest. In addition, a demographic description 
of Mexican-Americans in San Antonio included statistics, pic- 
tures and other information on population, income, housing 
and education. These two sections established very well the 
context in which the San Antonio Language Research Project 
operated. 

McDowell 

McDowell compared the intelligence, general ability 
and reading achievement of first grade children from three 
ethnic groups—Negro, Anglo and Spanish-surname—during the 
first year of the longitudinal study, 1964-65,2 intelligence 
was measured by the Goodenouah -Harris Draw-A-Man Test adminis- 
tered in September. General ability was measured by the Test 
of General Abilitv. Inter- Ameri can Series , administered in 
September. The Test of Re ading. Inter-American Series , ad- 
ministered in May, was used to measure reading ability. 



^Ibid . , p. 228. 
2, 



^ ^* Wc^well. "A Study of the Academic Capabili- 

ties and Achievements of Three Ethnic Groups: Anglo, Negro, 
and Spanish- Surname, in San Antonio, Texas," Unpublished 
Doctoral Dissertation, The University of Texas, Austin, 1966. 
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The children tested were in twenty-eight classes in 
nine "experimental" or Project schools and in twenty-one 
classes in twelve "control" schools. The control schools 
were selected to represent a cross-section of the San Antonio 
Independent School District. They included Negro schools, 
predominantly Spanish- surname schools, upper-class Anglo 
schools and mixed schools including children from lower- 
middle class Anglo and Spanish- surname homes. Estimate of 
social class levels was provided by personnel of the school 
district. 

The Draw-A-Man Test presumably was administered to 
all children in English. The Test of General Abilitv, Inter- 
American Series was administered in Spanish to children in 
the e3<perimental schools and in English to children in control 
schools. The Test of Reading, Inter- American Series ^ was ad- 
ministered in English and Spanish to children in experimental 
schools and in English only to children in control schools. 

McDowell designed his study to tes+" the following 
hypotheses: 

(1) There will be no difference in the pretest 
(general ability) scores among the Anglo, 
Negro, and Spanish- surname children in the 
control schools of Horn's study. 

(2) There will be no difference in the posttest 
(reading achievement) scores among the Anglo, 
Negro, and Spanish- surname children in the 
control . schools when the related factors of 
age, intelligence, pretest score, and school 
attendance are held constant. 
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(3) The children in the control group who are 
repeating first grade will not demonstrate 
significant gains in reading achievement 
over matched groups of nonrepeaters from 
the control and experimental schools. 

(4) There will be no significant difference among 
the posttest scores (reading achievement) of 
Negro children, Anglo children, and the 
Spanish-surname children of the control schools 
and the children in the experimental schools 
when th^^elated factors of age, intelligence, 
pretest score, and school attendance are held 
constant. 

He also measured the: 

. . . differences between pretest (general ability) 
and posttest (reading achievement) scores when the 
children of the control and e3q>eriinental schools 
were grouped according to socioeconomic level. 2 

In testing the first hypothesis, McDowell foxind that 
the Anglo children scored higher than either the Negro or the 
Spanish children in all areas of the pretest scores. Sig- 
nificant differences occurred between the scores of the 
Spanish-surname children and the Negro children on four vari- 
ables. The differences on the oral-vocabulary subtest of the 
Test of General Ability. Inter-American Series favored the 
Negro children. The differences on the association subtest of 
the Test of General A bility, Inter-American Series and on both 
the raw scores and standard scores from the DrawA-Man Test 
favored the Spanish-surname children. 



Ibid., pp. 8-9. 
^ Ibid . . p. 9. 
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Because of interaction among the groups on three of 
the covariables to be considered in testing the second hy- 
pothesis, no statement could be made "concerning significance 
of differences among groups for the continuous range of 
scores on the criterion variable*"^ This same difficulty 
also prevented meaningful comparisons from being made as 
planned in the fourth hypothesis above • 

In testing the third hypothesis, Spanish-surname and 
Anglo repeaters in the control schools and nonrepeaters in 
the experimental and control schools were matched by standard 
IQ scores. Scores of three groups were then compared: 
repeaters in control schools, nonrepeaters in control schools 
and nonrepeaters in eaqperimental schools* When the total 
group was the sample, the repeaters scored significantly 
higher (.05) than the nonrepeaters on two of the subtests on 
the pretest, Test of General Ability. Inter->American Series . 
On the post test. Test of Reading, Inter-American Series , no 
group scored significantly higher than any other. 

When the scores of Spanish-surname children in the 
control schools were analyzed separately, the repeaters were 
found to have scored significantly higher (p « .001) on the 
pretest but there were no significant differences at the .05 
level on the poattest. When the scores of the Anglo chil- 
dren in the control schools were analyzed separately, the 



Ibid . . p. 103. 
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differences between the repeaters and nonrepeaters were not 
significant on either the pretest or posttest. 

McDowell also grouped t^e children according to 
socioeconomic class and compared their scores on two sxib- 
tests (oral-numerical and association-classification) of the 
pretest, the Test of General jOpilitv. Inter-American Series , 
and on total comprehension-vocabulary of the posttest, the 
Test of Reading . Inter- American >eries . The control classes 
were designated as upper-class, middle-class, middle- lower 
class, or lower- lower class. The pupils in the experimental 
schools, all considered to be lower-lower class, were kept 
separate for this analysis. For each of the three tests, 
McDowell presented a table showing the mean scores, the dif- 
ference between the means, the variances and the difference 
between the variances for each social class. The rank order 
of the groups according to the mean scores for both pretests 
was: middle- lower class, upper-class, lower- lower class, 
middle-class and experimental group. On the posttest the 
rank order remained the same except that the middle- lower 
class dropped from first to fourth place. McDowell did 
discover, however, in looking at the relationships between 
each pretest variable and the posttest variable, that the 
scoring pattern of the middle-lower class was different from 
that of the other four groups. As it turned out, all the 
children representing the middle- lower class were Negroes 
from the same school. 
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The conparison of the scores of the various social 
classes revealed several points of interest according to 
McDowell : 

(1) The children in the escperimental classes 
scored significantly lower as a group on 
both pretest and posttest than the children 
in any of the socioeconomic groups of the 
control schools. 1 

(2) The children in the upper class appear to 
achieve greater gains in reading than chil- 
dren of the other groups. 2 

(3) It is possible to conclude that there are 
definite differences among the socioeconomic 
groups in their scores on the different sec- 
tions of the two tests in question. 3 

McDowell performed a series of analyses to supplement 
those which proved inappropriate for testing Hypotheses 2 and 
4 due to interaction among the groups. First, he planned to 
use analysis of covariance to conqpare the scores of the chil- 
dren in the three ethnic groups in the control schools except 
for the Negro children representing the middle-lower class. 
The pretest, the Test of General Ability. Inter-American 
Series, was the independent variable and the Test of Reading . 
Inter-American Series was the criterion variable. Age, 
intelligence and attendance were held constant. This time no 
significant interaction among the groups was found so analy- 
sis of covariance was appropriate. No significant differences 

^Ibid . . p. 127. 
^ Ibid . . pp. 127-28. 
^Ibid . . p. 128. 
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were found on the criterion scores among the three ethnic 

groups in the control schools. 

Second, he repeated the first analysis except he also 

dropped out the scores of the Spanish- surname repeaters. 

Again he found no significant differences on the criterion 

scores among the three ethnic groups. Because of the results 

of the first two analyses, he conjectured that: 

. • . significant differences in group scores are 
occasioned by an unequal distribution of students 
at the different ends of the continuous scale of 
values (whatever the values: e.g., age, intelli- 
gence, pretest).-*- 

Third, he intended to compare the scores of the chil- 
dren in the controJ. classes, minus the middle- lower class 
children, with the scores of the children in the e3q>erimental 
schools. Significant interaction prevented this analysis 
from being coitpleted. ' 

Last, he was going to repeat the third analysis ex- 
cept that the scores of the Spanish- surname repeaters would 
be excluded. However, again significant interaction found 
among the groups prohibited conpletion of the analysis. 

In conclusion, McDowell noted that although differ- • 
ences were found among the ethnic groups in the control 
schools in the pretest and the posttest scores, limitations 
of the study prevented anything to be said about the 



^ Ibid . , p. 156. 
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differences except that "they do exist and the effect on 
schooling of the groups is not consistent."! 



Additional stud-:«:^c; 



studies by Bradley, Bussey and Comejo were also 
done as part of the San ^tonlo Language Research Project. 
All related to oral language of children.2 The studies by 
Bradley and Bussey involved .^re extensive use of the Gloria 
asd.£avid sentence repetition test first used by Taylor. 3 



^Ibid. . p. 148. 



^-^e^o^llielly^'^l'^ ^^^t^- Oral 

Children of a French ?inm,?!?'^T, ^ ^''^"P Fourth Grade 
Master's Shetis^^e S^J^^sfiS ^J^J^^P^^, " Unpublished 
N. Puth Bradley "A Two-v^f^ Austin, 1970j 

of Oral Langufgi Pro^TcIencJ fS^f °^ Relation 

First Grade Chi!dSfiiin^p?Sl?^?^"5.^^«v«"ent of 

Unpublished Doctoral SissertfJfSn JS''^?^^^ Backgrounds- 
Austin, 1971; Jo SslafBuiS' ^? University of Texas, 

Phonological Vari^LS^ng If^^i Se^S'^^^j^SK?^!^*^^ 
Linguistically Different^ilrfrof 7' and Third Grade 

Schools," Unpiblishid SSct^af Sflserfir ^^Jf'^^^^o (Texas) 
of Texas, Austin, 1971; SicSdo rft^ ^® University 

A Study of the lexicon o^^Se^iJlfJfJ^S?^^?' "^ili^g^aliL: 
Children of Texas," Unpublished JSZ;T°i^ Spanish-Speaking 
University of Tex^s, ASSiifgg^'^^^^^l Dissertation, The^ 



^Taylor, "Year Five Findings." 
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SUMMARY 

Beginning in the fall of the 1964 a group of Mexican- 
American "disadvantaged" children in a research project in 
San Antonio, Texas, received instruction in one of three 
treatments: special English language instruction using oral- 
aural techniques with science content, special Spanish lan- 
guage instruction using oral-aural technicjues with science 
content, and instruction with the same science content but 
with no special language instruction. These treatments were 
known respectively as the OAE (oral-aural-English) , OAS 
(oral- aural- Spanish) and NOA (non-oral-aural). After three 
years the NOA treatment was discontinued and the labels of 
the first two were changed to LCE (language-cognition English) 
and LCS (language-cognition Spanish), other changes were made 
in the treatments which affected only the children who entered 
first grade in 1966 or later. 

The eacperimental treatments began when the children 
were in first grade and were continued as they proceeded 
through elementary school. Most of the research efforts of 
the staff and students at The University of Texas at Austin 
had their focus on the effects of the treatments on the chil- 
dren who entered first grade in 1964 and 1965. The research 
sanple consisting of the children who entered school in 1964 
was known as Sample I. The children who entered school in 
1965 made up Sample II. 
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Horn found that after one year of treatment there 
were no significant differences among the scores of the OAE, 
OAS and NOA groups in Sample I in reading readiness. 

Arnold studied Sample I after tvra years of treatment 
and Sample II after one year. He coirpared the test scores 
of the three treatment groups and a control group selected 
from a cross-section of San Antonio Schools. The scores 
were taken from several tests of language, group intelligence, 
English reading and Spanish reading. In Sanple I Arnold 
found that generally there were no significant differences 
between the OAE and OAS groups, that the NOA group scores 
were significantly higher than the OAE and OAS groups and 
that the control group scores were highest. Although the 
results for Sample II were inconsistent, it was possible to 
generalize that the OAE treatment was superior in reading 
achievement and the control group was superior in reading 
readiness. 

Knight studied Sample I after three years of treat- 
ment and Saitple ii after two years. His analysis included 
no control group. He compared English and Spanish reading 
scores using analysis of covariance with pretest scores and 
group intelligence test scores as the co variables. The re- 
sults of the comparisons were not consistent but the findings 
for Sample I tended to favor the NOA group and for Sanple II 
the OAE group. 
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The research concerning the results after the fourth 
year is reported in Chapter III, which follows. For the 
Year Five study, Taylor constructed an oral language test 
which was intended to measure phonology, intonation ctnd 
fluency in English, she used the test to assess the language 
of children who had been in LCE or LCS classes for five years 
(Sanple I) and four years (Sair5>le II) and of children who 
had never received special language instruction. For Saxrple 

I the LCE group received the highest mean score in Phonology. 
The LCS group received the highest mean scores in Fluency and 
Total Language. The control group received rhe lowest mean 
scores. For Sample II none of the differences was signifi- 
cant. 

Ott compared the English language abilities of Saxiple 

II children in the OAE and NOA groups when they were in first 
grade. 3he used the Brengelman-Manning Linguistic Capacity 
^^dex to measure receptive language, i.e., ability to under- 
stand oral language: she constructed her own test to measure 
e:»q>ressive language, i,e. , ability to speak. She concluded 
that both groups made significant gains in receptive and 
expressive language. The gains of the OAE group were signifi- 
cantly greater than those of the NOA group in e3q>ressive 
language but there was no significant difference between the 
groups in amount of gain in receptive language. 
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Jameson constructed a test for' assessing children's 
ability to produce certain phonemes. She administered the 
test to some children in sanple II while they were in first 
grade and to other Spanish and non-Spanish-speaking children 
from preschool through sixth grade. Jameson's contribution 
consisted mainly of a contrastive outline of English and 
Spanish phonology, the instrxament for assessing language 
and a summary of the data collected when the test was adminis 
tered. She attempted no evaluation of the escperimental treat 
ments. 

Pena attempted to evaluate control of syntax in the 
oral English and Spanish of first graders. The instrument 
he used was the research edition of a test developed by 
Stemmler. The children tested entered first grade a year 
later than the children in Santple II. The experimental treat 
ments, however, were approximately the same as in previous 
years: OAE, OAS and NOA. A fourth group (NOA-NS) consisted 
of children in the Project schools who had not received spe- 
cial language or science instruction. Pena administered the 
test in September as a pjretest and in April as a posttest. 
He found in general that there were few significant differ- 
ences among th.e four groups on either the pretest or the post- 
test. 

Arnold tested reading achievement of children in 
Sanple I at the end of second grade and the beginning of 
third grade. He found that over the summer the NOA group 
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scores declined significantly, the OAE group more or less 
maintained its position and the OAS group improved its 
scores* 

Fowler administered the Brengelman-Manning Linguistic 
Capacit y Index , the reading tests in the Metropolitan Achieve- 
ment Tests , and the Metropolitan Readiness Tests to a sairple 
of first graders who entered school in 1967. These children 
received special oral language instruction either in English 
or in both English and Spanish. Fowler fovmd significant 
correlations between each of the Linguistic Capacity Index 
scores and the three subtests of the Metropolitan Achievement 
Tests. She also found gains in predictive efficiency when 
Linguis tic Capacity index scores were added to information 
about IQ, sex and treatment in order to predict reading 
achievement as measured by the Metropolitan Achievement Tests > 
Finally, she foxind significant gains in predictive efficiency 
in six of twelve possibilities when Linguistic Capacity Index 
scores were added to Metropolitan Readiness Teats scores to 
predict reading achievement. 

Pauck analyzed scores of the same r^hildren as in 
Fowler's sample except that ae used the fluency section of 
the oral language test Ott designed instead of the Linguistic 
Capacity Index. He counted words in the children's responses 
to certain questions to obtain scores for the oral language 
test. He found that the oral- language test and the Metro- 
politan Readine ss Tests were equally good predictors of 
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reading achievement as measured by the Metropolitan Achieve - 
ment Tests . 

Melton administered the reading sections of the 
Metropolitan Achievement ta.s^., to six classes of first grade 
children receiving the experimental treatments who entered 
school two years later than the children in Sample II. 
Three of the classes received reading instruction in a basal 
reader program and the other three classes used experimental 
reading materials based on the special oral language treat- 
ments. She found that the basal reading pupils scored sig- 
nificantly higher than the experimental reading pupils. 

Cook and Arnold constructed a reading test with the 
same format as the Metropolit an Achievement Tests , but based 
on the content of the experimental reading program. Cook 
administered the tests to eight first grade classes including 
the same ones in Melton's research san^jle. On this test the 
experimental reading pupils scored higher than the basal ' 
reading pupils but the difference was significant on only 
one subtest. 

MacMillan investigated the relationships between cer- 
tain socioeconomic factors and school %chievement and atten- 
dance of pupils in Sample I during their first grade year. 
He also collected data on first grade pupils of other ethnic 
groups and socioeconomic classes in the San Antonio school 
district. He found, among other things, that a combination of 
socioeconomic variables was a predictor of achievement and 
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sex as a single factor was not a significant predictor. He 
discovered significai.t differences in attendance patterns in 
the three socioeconomic classes in the Anglo ethnic group. 
He also found that Negro group attendance was nost affected 
by weather conditions. 

McDowell compared the intelligence, general ability 
and reading achievement of first grade children from three 
ethnic groups and several socioeconomic classes. For some 
of his analyses he included children from Sample I. McDowell 
found significant differences on pretest and posttest scores 
among the three ethnic groups in the control schools. He 
also found that the children in the experimental classes re- 
ceived significantly lower scores than those of children in 
any socioeconomic group in the control schools. 

The studies reviewed here had several methodological 
weaknesses in common. For instance, none of them included 
adequate descriptions of the ejcperimental treatments. How- 
ever, since the studies were related, sample lesson plans 
were only provided in the appendices of one or two which 
gave a clear idea of the special curriculum. Information 
such as the nature of the supervision and inservice training 
was not described in detail, in addition, none of the 
authors gave a candid explanation of the procedures used 
in developing the curriculumi materials for any of the grade 
levels. 
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None of the writers described the regular San Antonio 
curriculum. Thus, any conparisons of the experimental treat- 
ments with .he regular curriculum cannot be interpreted mean- 
ingfully. Uncertainty as to the time teachers actually de- 
voted daily to the experimental treatments alto inhibits 
accurate interpretation of results. 

Several of the studies displayed an unsophisticated 
use and understanding of statistics. Dissertations involving 
correlation did not clearly indicate that the authors under- 
stood the difficulties inherent in such studies. Also, some 
remarks reflected confusion with regard to the distinction 
between correlations high enough to be useful and correla- 
tions high enough to be statistically different from zero. 
In several studies, readers are handicapped by inconplete 
presentations of data. For instance, in one correlation 
study and in one study involving analysis of covariance, the 
means and standard deviations .for each group on each variable 
were not reported. These weaknesses in statistical matters 
may be at least part of the cause for another disappointing 
feature of many of the studies. The discussion sections 
found in the concluding chapters were frequently quite brief. 
The reader is given too little help in interpreting the data. 

Several weaknesses in research design affect the 
conclusions that can be drawn from the lesults of several 
studies. An unknown number of subjects in the Year Two 



133 

(1965-66) and Year T-hree (1966-67) studies did not receive 
the same treatment for the entire two or three years they 
were in the Project. The attrition rate was so great that 
the Year Four (1967-68) study, which is contained in Chapters 
III and IV of this dissertation, and the Year Five (1968-69) 
study both suffer from very small sample sizes contJared to 
the original number of pupils in each treatment. The results 
of a number of still other of the studies would have been 
more revealing if appropriate control groups had been in- 
cluded in the research design. 

Evaluation efforts throughout the life of the Project 
were handicapped by a lack of tests for assessing various 
aspects of oral language in English and Spanish. This lack 
was, of course, the reason why several dissertations consisted 
of attenpts to develop appropriate tests. The authors of 
these oral language instruments could have made even greater 
contributions, however, if they had provided conplete ration- 
ales for the methods they used in constructing the tests. 
The authors did not establish firm connections between the 
aspect of language supposedly being measured and the measure 
itself. In other words, the validity of the tests is ques- 
tionable. The methods of evaluating responses were not always 
explained clearly, in addition, the authors did not furnish 
rationales for the scoring systems they used. A few of the 
authors of the oral language teats did not describe the con- 
ditions under which their teats were administered, the 
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procedures used in the testing and the procedures for scoring 
the responses. 

The deficiencies in research methodology probably 
reflect two major types of problerr.3: (1) those associated 
with attempting longitudinal research in the public schools 
without strong commitment and sustained cooperation from the 
school personnel in a project conducted by a small staff sup- 
ported by a limited budget; and (2) those to be expected in 
research carried out by individuals who lack extensive train- 
ing in research methodology and statistical techniques. 



CHAPTER III 



DESCRIPTION OF PROCEDURES AND 
RESULT3 OF DATA ANALYSES 

This study was designed to evaluate the effective- 
ness of various experimental treatments of the San Antonio 
Language Research Project at the end of Year Four (1967- 
68). Scores from the Iowa Tests of Basic SleLlls and The 
Lorge-Thomdike Intelligence Tests were analyzed for third 
and fourth grade children who had received: 

(1) Intensive oral-aural instruction in English, 
with science content. This group will be referred to as 
the ICE (Language-Cognition English) group. 

(2) Intensive oral-aural instruction in Spanish, 
with science content. This group will be referred to as 
the LCS (Language-Cognition Spanish) group. 

(3) Instruction in science content but without 
oral-aural language teaching techniques for two years and 
intensive oral-aural instruction in English for one year. 
This group will be referred to as the NOA-LCE group. 

(4) Instruction in the usual San Antonio curric- 
ulun- with no experimental curriculum or special inservice 
activities. This "control" group will be referred to as 
the SAC group. 
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The remainder of this chapter relates in detail the 
operational and statistical procedures used and the results 
of the data analyses. 

DESCRIPTION OF PROCEDURES 
Population 

Information in Chapter I established the economic 
and educational plight of Mexican-Americans. Dissertations 
by MacMillan and Bussey provide information describing the 
pupils who attend Project schools and their families.^ In 
suinmary, almost all of the children in the Project schools 
are Spanish- speaking Mexican-Americans of low socio-economic 
status. 

Sampling Procedure 
Despite the fact that between six and eight hundred 
pupils were in experimental classes in each grade in each 
year of the Project, the san^jle sizes for research purposes 
have been smaller than 200 for each treatment. Complete 
data for every child was difficult to secure, partly because 



^Robert W. MacMillan, "A Study of the Effect of 
Socioeconomic Factors on the School Achievement of Spanish- 
Speaking School Beginners," The University of Texas, Austin, 
1966; Jo Ann Keslar Bussey, "A Comparative Study of Phono- 
logical Variations Among First, Second, and Third Grade 
Linguistically Different Children in Five San Antonio 
(Texas) Schools," The University of Texas, Austin, 1971. 

i 
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of transfers and absenteeism of pupils and partly because 
of teacher attrition. Table 5 shows the number of pupils 
in experimental classes during the first three years and 
the number of subjects in the data analyses. 

By Year Four (1967-68), it became apparent that a 
large number of children were not being promoted with their 
classmates or were being assigned to non-experimental classes 
or to classes receiving different treatments from the one 
they received in first grade. The intention of this study 
was to evaluate the long-run effectiveness of the experi- 
mental treatments. Therefore, it was deemed appropriate to 
include in the sample only pupils who had received the same 
type of instruction for their entire school career and who 
had not failed a grade. (The niomber who failed was very 
small.) Naturally, limiting the sample in this way would 
be one more factor reducing the sample size. 

In order to identify the children who had always 
been in the same treatment, three resources were used. A 
list of all the pupils for whom scores of The Lorae- 
Thorndike Intellige nce Tests and the Iowa Testa of Basic 

available was compiled and con5)ared first to a 
list supplied to Taylor-"- by Ellen Snow, who was on the staff 



Thomasine H. Taylor, "The Effects of Oral-Aural 
^l^^fffv*^^*^"^"^ °^ English Language for Fourth 

and Fifth Grade Disadvantaged Mexican- American Children, " 
Unpublished Doctoral Dissertation, The University of Texas, 



Table 5 



Niimber of Children in Experimental Classes' and in 
Data Analyses Year One (1964-55), Year Two 
(19€5-66) and Year Three ( 1966-67) , 
San Antonio Language Research Project 

Ni3inl>er of Pupils Nianiber of Pupils 
Treatment Grade in in 
Exper, Classes Data Analyses 

Santple I 
(Entered first 
grade 1964) 



LCE 


1 


250 


186 




2' 


300 


178 




3 


200 


120-1602 


LCS 


1 


250 


204 




2 


250 


178 ^ 




3 


190 


114-1522 


NOA 


1 


250 


194 




2 


275 


177 




3 


250 


150-2002 



Sanple II 
(Entered first 
grade 1965) 



LCE 


1 


300 


187 




2 


304 


182-2432 


LCS 


1 


175 


105 




2 


202 


121-1612 


NOA 


1 


275 


16Q - 




2 


255 


153-2042 



■^Con5)iled from Thomas D. Horn, "A Study of the 
Effects of Intensive Oral- Aural English Language Instruc- 
tion, Oral-Aural Spanish Language Instruction and Nbn-Oral- 
Aural Instruction on Reading Readiness in Grade One," The 
University of Texas, Austin, 1966, p. 18; Richard D. 
Arnold, "Capsule Description of San Antonio Language- 
Bilingual Research Project," The University of Texas, 
Austin, 1967 (Mimeographed); Richard D. Arnold, "1965-66 
(Year Two) Findings, San Antonio Language Research Project, 
Thomas D. Hbm, Director," The University of Texas, Austin, 
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of the San Antonio Bilingual Research and Dissemination 
Center. This list contained the names of all children who 
had received consistent treatment and had not failed a 
grade through Year Five (1968-69). This left unchecked 
the students who received consistent treatment only through 
Year Four (1967-68). Therefore, two other resources were 
used: treatment histories prepared by the teachers of 
experimental classes in Year Four (1967-68) and the Perma- 
nent Cumulative Records for each pupil. This process 
resulted in the sample sizes shown on Table 6 . 

The children who served as controls were pupils in 
Project schools who had never been in experimental classes 
and whose scores were made available to this researcher. 
The controls received the regular San Antonio curriculvun 
so they will be referred to as the SAC group. 

Examination of the ages of the children in the SAC 
group showed that at least one of the twenty-one fourth 
grade pupils was overage for his grade and at least eight 
of the sixty-eight third grade pupils were overage. Thus, 



1968, pp. 6 and 8j and Lester N. Knight, "1966-67 (Year 
Three) Findings: A Comparison of the Effectiveness of 
Intensive Oral-Aural English Instruction, Intensive Oral- 
Aural Spanish Instruction, and Non-Oral- Aural Instruction 
on the Oral Language and Reading Achievement of Spanish- 
Speaking Second and Third Graders," The University of 
Texas, Austin, 1969, pp. 106 and 127. 

2 

Estimates derived from Knight's statement that 
sixty to eighty percent of the scores survived his matching 
technique. Knight, og. cit., p. 127. 
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Table 6 

N\mber of Children in Research Sample 
Year Four (1967-68), San Antonio 



Treatment Group^ 


Grade Four 


Grade Three 


1 ?E 


29 


109 


NOA-LCE 


15^ 


59 


LCS 


60 


40 


NOA-LCS 


17^ 


1^ 


, SAC 


21 


68 



LCE = Language-Cognition English; NOA-LCE = Non- 
Oral-Aural until Year Four and then Language-Cognition 
English; LCS = Language-Cognition Spanish; NOA-LCS = Nbn- 
Oral-Aural until Year Four and then Language-Cognition 
Spanish; and SAC = San Antonio Curricul\im (controls). 



^Data for these children were not analyzed. 
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it is possible that some of the SAC pupils had failed at 
least one grade whereas scores for the few children who had 
been failed in the experimental groups were eliminated from 
the data analyses. 

Scores in three cells on Table 6 wore not used in 
the data analysis. The cell with one student, Grade Three 
NOA-LCS, is obviously inappropriate for further analysis. 
Tlrie cells with fifteen and seventeen pupils. Grade Four 
NOA-IX:e and NOA-LCS, were dropped because of small size and 
because all the children in each cell had the same teacher 
in Year Pour (1967-68). Moreover, the seventeen children 
in the NOA-LCS group had a first-year teacher. 

In Year Four (1967-68) the students in each treat- 
ment came from several classes in several schools. This 
fact may help to moderate the teacher variable effect. 
Table 7 reveals the breakdown by school and teacher. 

Instrumentation 
The Iowa Tests of Basic Skills were chosen as the 
major instnament for the study since they are intended to 
measure generalized intellectual skills and abilities in 
third to ninth graders. They are not intended to measure the 
"acquisition of specific information in special subjects."^ 



•^E.F. Lindquist and A.N. Hieronymus, Manual for 
Administrators. Supervisors, and Counselors: Iowa Tests 
of Basic Skills. Houghton Mifflin Co., Boston, 1964, p. 4, 
hereafter referred to as Manual for Administrators. 
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Table 7 



Niimber of Teachers 
Sample 1 Year Four 


and Schools 
(1967-68), 


in Research 
San Antonio 




School 


Number 


of Teachers 




Grade Four 




2 


3 


Crockett 




2 




Carvajal 


1 


1 




Johnson 






- 


Ogden 


1 




3 


Ruiz 








Brackenridge 


1 


2 




Grade Three 








Crockett 


1 


2^ 


3 


Carvajal 


2 


1 




Johnson 


2 


1 




Ogden 


2 




2 


Ruiz 


1 




2 


Brackenridge 


4 


1 




Brewer 


1 
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^IX;E = Language-Cognition English 
2 

ICS = Language-Cognition Spanish 

3 

SAC = San Antonio Curriculum 

^These two teachers taught one class, one each 
semester. 



The question remained as to what type of statistical analy- 
sis vwuld be most appropriate and useful. 

Scores for the Iowa Tests of Basic Skills were 
available from both fall and spring administrations. There- 
fore, one possible analysis would have been to conpare the 
mean gain scores of the different groups in order to see if 
the difference between the fall and spring scores is any 
greater for one group than any of the others. This alterna- 
tive was eliminated since the main interest was in what had 
happened to the children as a result of four or three years 
of experimental treatment rather than in what happened 
during Year Four (1967-68) only. 

Another possibility would have been to conpare the 
spring scores using analysis of covariance with the fall 
scores as the covariable. This means that the spring 
scores would be compared only after the groups had been 
equated statistically in terms of fall scores. This choice 
had at least two disadvantages. One, again this would 
result in looking only at what happened during one school 
year. Two, a child's ability or achievement level in the 
fall is due to many factors, among them: IQ, past teachers, 
other learning experiences in and out of school, and his 
cwn diligence. It would be hard to say in exactly what 
terms the groups were being ecpiated. 

Many of the children's test scores from previous 
years in the Project were theoretically available so that 



these scores might be considered covariables in a compari- 
son of the spring scores of the Iowa Tests of Basic Skills , 
The actual degree of availability was very low, however. 
Pvmch cards containing test data from Year One (1964-65) 
existed but the location of the decoding sheet at the time 
this ^tudy was begun was Tonknown. All the tests from 
Years Two (1965-66) and Three (1966-67) were graded and in 
storage. Unfortunately, disorder in the files, along with 
the low probability that there would be conplete data on 
any one child, discouraged anyone from investing the great 
amoxint of time necessary to do anything with the test scores 
accxamulated during the length of the Project. These diffi- 
culties ruled out any analysis using previous years" scores 
as covariables as well as any analysis of trends in test 
scores of the groups over tKe years. 

With the previously mentioned alternatives elimi- 
nated, the only reasonable analysis left seemed to be to 
compare the mean scores on the spring administration of 
the Iowa Te sts of Basic Skills . Obviously, a one way 
analysis of variance could be performed^ This involves 
"testing of the significance of the over-all variation of 
the means for several groups, the groups differing on the 
basis of a single classificatory principle*"^ In this 



-"■Quinn McNemar, Psychological Statistics . 3rd ed. , 
John Wiley and Sons, Inc., New York, 1962, p* 288. 
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case, the single classif icatory principle is the treatment 
group: intensive instruction in English or Spanish or 
regular San Antonio curriculum, in all other respects the 
groups would be assiomed to be equal. 

Thus, in order to obtain any meaningful conclusions 
from analysis of variance, one must know whether or not 
the experimental groups were similar at the beginning of 
the study, in the case of the San Antonio Language Research 
Project, no one knows for certain whether there were sig- 
nificant differences in educational factors between the 
groups at the beginning of Year One (1964-65). For the 
sake of argument one has to assume that there were no sig- 
nificant differences, although in several experimental 
classes the youngest children were found to have been 
deliberately assigned rather than being randomly selected. 
Analysis of Year One data indicated that the standardized 
instruments available were not appropriate for measuring 
whatever differences might have existed in reading readi- 
ness and general ability in disadvantaged Spanish-speaking 
first grade children. 

The children throughout the San Antonio Independent 
School District had administered to them a group IQ test, 
The lorge-Thorndike Intelligence Tests , in the fall of 
third grade. Thus, by Year Four (1967-68) IQ scores for 
both san^jles (third graders and fourth graders) were avail- 
able. How valid these scores are is questionable, but it 

ERIC 
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is probably safe to say that they are more valid than scores 
of any test administered at the beginning of first grade. 
In addition, whatever factors lead to lack of validity 
probably affect the scores of all the children in this 
research saiiple in about the same way. It was decided to 
perform also an analysis of covariance using IQ as a covari- 
able. Analysis of covariance "is applicable whenever it 
seems desirable to correct a difference on a dependent vari- 
able for a known variable on another variable which for some 
reason could not be controlled by matching or by random 
san5>ling procedures."^ 

In summary, the question to be answered in this 
study was: will there be significant differences among the 
mean scores of the treatment groups on the posttests? The 
Posttest was the Iowa Tests of Basic skills administered in 
the spring of 1968. Both euialysis of variance and analysis 
of covariance were to be performed. The covariable was to 
be IQ measured by The Lorae-Thomdike Intelligence Teats 
administered in the fall of 1967 for third grade children 
and in the fall of 1966 for fourth grade children. • 

The Lorae-Thomdike Intelligence Testa ^ 

This is a group intelligence test, the scores used 



■^McNemar, o^. cit . , p. 363. 



2 

. Irving Lorge and Robert L. Thorndike, Hbughton 
MifflmCo., Boston, 1957. 
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in the Year Pour (1967-68) analysis are from Level 2, Form 
A, designed for grades two and three and consisting only of 
a non-verbal battery. This test is extensively reviewed 
in The Fifth Mental Measurements Yearbook .^ For that 
reason, comments here will be brief. 

According to Freeman, the Lorge-Thorndike series 
are on the whole among the sounder group instniments avail- 
able. Pidgeon claimed that although one may doubt the 
value of the non-verbal tests for predicting ability to 
read or any scholasT:ic performance, the test does ade- 
quately serve the purpose of separating children into rela- 
tively homogeneous ability groups.^ He warned that no 
assujiptions should be made about the test nieasuring mental 
capacity, but "the tests provide reliable meastires of 
verbal reasoning and nonverbal reasoning."^ 

In this study the assumption is not made that the 
Lorge-Th orndike tests measure or predict innate ability to 
learn. Probably the test scores reflect differences in 
cultural background, in what has already been learned, in 

lOscar K. Euros, ed. , The Fifth Mental Measurements 
Yearbook, The Gryphon Press, Highland Park, N.J. , 195^, pp. 
47:7-**8**. 

^Frank S. Freeman, The Fifth Mental Measurements 
Yearbook , p. 481. " 

D.A. Pidgeon, The Fifth Mental Measurements Year- 
book, p. 483. — ^ — - 

^Ibid . , pp. 483-84. 



the ability to take tests aiid to understand classroom lan- 
guage, arnong other things. However, learned and innate 
mental capacities no doubt at least partially underly these 
abilities. The statistical measures and analyses were 
chosen to facilitate the accounting for differences in 
mental capacity of the groups which might affect reading 
and the other criterion skills measured by the post tests. 
In order to avoid any possible confoianding of the results, 
an analysis of variance and an analysis of covariance using 
IQ as the covariable are reported. 

Iowa Teals of Basic Skills "^ 

« 

/ This is a group test measuring skills in vocabulary, 
reading, language, work-study and arithmetic. The scores 
used in the Year Pour (1967-68) analysis are from Form 4 
for Grade Three and from Form 3 for Grade Four. 

The battery actually consists of one long test for 
each area or sub-area, but each grade takes only a part 
of each test. The part-.s assigned to each grade level par- 
tially overlap each other. 

Vocabularv : ^ Each item contains the stimulus word 
in the context of a phrase or sentence. The pupils must 



E.F. Lindquist and A.N. Hieronymus, Houghton 
Mifflin Co., Boston, 1964. 

2 

The following descriptions of each test are 
adapted from Manual for Administrators , pp. 27-37. 
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mark the correct one of four definitions. Each definition 
is a word or a phrase. 

Reading Comprehension; The pupils read several 
selections that vary in length up to half a page. Then 
the pupils answer questions based on the selections by 
marking one of four responses. 

The passages were chosen in an atteirpt to repre- 
sent as completely as possible all of the types 
of material encoiintered by the pupil in his 
everyday reading. They ware adapted from a wide 
vaij-ety of sources: newspapers, magazines, en- 
cyclopedias, government publications, textbooks 
and original literary works. 

Some of the skills measured in the Reading Corrprehension 
test include: 

(1) Recognizing and understanding stated or 
implied factual details and relationships. 

(2) Discerning the purpose or main idea of a 
paragraph or selection. 

(3) Organizing ideas. 

(4) Evaluating what is read. 

Language Skills; These tests consist of four parts: 
Spelling, Capitalization, Pxuictuation and Usage. On the 
spelling test, each item has four words listed. One or 
none of the words is misspelled. The pupil mcirks either 
the incorrectly spelled word or a "No mistakes" response. 



^Manual for Administrators , p. 27. 
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"The tests were constructed to sample systematically a wide 
variety of error types" such as final e or e before suffix, 
interchanged letters, and t and ed substitutions.^ 

In both the Capitalization and Punctuation tests 
each item has one or two sentences printed in three lines 
of about equal length. The pupil marks the line with an 
error or a fourth response labeled "No mistakes." 

Each item in the Usage test contains three sentences. 
One or none of the sentences has a usage error in it. The 
pupil marks the sentence with the error or a fourth response 
labeled "No mistakes." The errors are such things as use 
of pronouns and verbs and avoidance of double negatives. 

Work-Study Skill.... These tests cover: (1) knowl- 
edge and use of map materials; (2) knowledge and use of 
graphic and tabular materials; and (3) knowledge and use of 
reference materials, m the Map Reading test each map is 
followed by several questions. The pupil must pick the 
correct answer from four responses. Various types of maps 
are included. The skills include such things as deter- 
mining distance and describing location. 

The test entitled Reading Graphs and Tables pre- 
sents several types of graphs and tables. A series of 
questions requires the pupils to display such skills as 



^ Ibid . . p. 32. 
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reading facts from the graphs or tables, making coitparisons 
and interpreting. The pupil selects the correct response 
from four possibilities. 

In Knowledge and Use of Reference Materials the 
pupil again chooses the correct answer from four possibili- 
ties. The topics include using word lists, using general 
reference materials, alphabetizing words, using an index, 
using a table of contents e'.nd using an encyclopedia. 

Arithmetic Skills: These tests include items on 
(1) arithmetic concepts and (2) arithmetic problem solving. 
In both tests each item consists of a question, m Arith- 
metic Concepts the child has to pick one of four answers. 
In Arithmetic Problem Solving the child has to solve the 
problem stated in the item. The four responses he has to 
choose from include three possible answers and a response 
labeled "Not given." The skills in the arithmetic testa 
cover concepts involving currency, decimals, equations, 
fractions, geometry, measurement, numerals and number sys- 
tems, percent s, ratios and proportions, and whole numbers. 
The number of items for each test at each grade 
, and the time limits allowed for each test are given 
below :^ 



•' •Ibid . . p. 5. 
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Number of Items Nuiriber of Items Time in 
Test Grade Three Grade Four Minutes 

Vocabulary 31 38 17 

Reading 

Comprehension 60 68 55 

Spelling 31 38 12 

Capitalization 38 39 15 

Punctuation 38 39 20 

Usage 32 32 20 

Map Reading 27 32 30 

Reading Graphs 20 24 20 

and Tables 

Knowledge and Use 42 52 30 

of Reference 

Materials 

Arithmetic 30 36 30 

Concepts 

Arithmetic Problem 25 27 30 

Solving 

With regard to the norms used in this study, the 
publishers of the Iowa Tests of Basic Slcills said that 
the sairpling units for standardization purposes were 
selected primarily on the basis of community size and socio- 
economic characteri sties No mention was made of the 
languages or dialects spoken by the children in the stan- 
dardization sanple. 



^Ibid , . p. 51. 
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The following conclusions from the reviews in The 
Fifth M ental Measurements Yearbook are especially relevant 
for this study: 

' (1) "The re^aewer would strongly recommend the 

tests as a whole as the oest of their kind. 

(2) Intercorrelation coefficients and examination 
of test items reveal that the skills most heavily tested 
are vocabulary and reading, 

(3) "E^qperience suggests that, together with 
reading coinprehension and vocabulary, the letnguage tests 

should form a composite yielding a reasonable measure of 

2 

language attainment." 

(4) Split-half reliability coefficients are quite 
satisfactory especially for the major test and composite 
scores. 

Testing Procedures 

The Lorqe-Thorndike Intelligence Tests were given 
to the grade four children in the fall of 1966 and to the 
grade three children in the fall of 1967. The tests were 
administered and scored by the classroom teachers. 

The Iowa Tests of Basic Skills were administered 
by the teachers co both grades in April, 1968. The tests 



G.A.V. Morgan, The Fifth Mental Measurements 
Yearbook , p. 36. 

^ Ibid . . p. 35. 
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were electronically scored by the Houghton Mifflin Scoring 
Service in Iowa City, Iowa* ' 

Data Analysis Procedures 
The first run through the CDC 6600 Conputer at The 
University of Texas at Austin Computation Center produced 
means and standard deviations for each experimental group 
in each grade on each of the tests: one set for The 
Lorge-Thorndike Intelligence Tests and fifteen for the 
Iowa Tests of Basic Skills > Jennings' routine LINEAR, a 
statistical computer program from the Edstat-J Library at 
the Computation Center, was used for analysis of variance 
and covciriance on the second mn throvigh the coitputer.^ 

RESULTS OF DATA ANALYSES 

Descriptive Statistics 
The means emd standard deviations for Tha Lorge- 
Thorndi ke Intelliaence Tests and the Iowa Tests of Basic 
Skills are reported in Table 8 and Table 9. 

The scores of the Iowa Tests of Basic Skills are 
the grade equivalents of raw scores. "The grade equiva- 
lent of a given raw score on any test indicates the 
grade level at which the typical pupil makes this raw 

^* Jennings, A Subroutine Svstem for Data Pro^ ^ 
cessing in the Behavioral Sciences . The University of 
Texas, Austin, 1968. 
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score." For scores between 3.1 and 8.1, the first digit 
represents the grade and the second digit the end of the 
month in whicn the typical student makes that particular 
raw score. If the second digit is a zero, the score cor- 
responds to the raw score of the typical pupil at the 
beginning of the grade. The publisher warns that grade 
equivalents below 3.1 "may not be interpreted literally 
as indicating grade and month" because they were not ob- 
tained from actual distribution of scores.^ 

Rather, they were derived statistically on the 
assuiiption that pupils grow in the skills 
measured at the same rate in the first two 
grades ... as in the third to eighth grades. 
This assuit5)tion is not strictly valid. ^ 

Comparing Treat ment Groups bv Sxibtest 

Fourth Grade; Although the SAC group mean score 
was highest on the IQ measure,- the LCE 'group scored 
highest on 13 of 15 possibilities on the Iowa Tests of 
Basic Skills (ITBS). The LCS group IQ was nearly the 
same as that of the LCE group, but the LCS means on the 
ITBS were lowest on 14 of 15 possibilities. This general- 
ization thus accurately describes the order of the mean 



Manual for Administrators , p. 13. 
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scores of the three treatment groups on the ITBS : LCE ^ 
SAC^, ix:s. Results of analysis of variance used to deter- 
ndne which of the differences are statistically significant 
are described in the next section. 

Third Grade; The SAC group attained the highest 
mean scores on the IQ measure and on every test of the Iowa 
Tests of Basic Skills . The ICS group received the lowest 
mean score on 14 of 15 possibilities on the ITBS. The LCE 
and NOA-I/:e means were in between, with neither consistently 
superior to the other. This generalization describes the 
order of the mean scores of the four treatment groups on 
the ITBS: SAC 7 ICE = NOA-LCE > LCS. Results of analysis 
of variance used to determine which of the differences are 
statistically significant are described in the next section. 

Compari ng Subtest Scores of Treatment Groups 

To conpare the relative standing of aiiy group on 
the different tests of the Iowa Tests of Basic STcills one 
cannot use the grade equivalent scores. The amovint of 
variation in scores is not the same on every test. One must 
use percentile rankings. The procedure utilized was to 
change the grade equivalents to percentiles by using 
tables giving end-of-year norms for school averages.^ 



Manual for Admini strators , pp. 73-74. 
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The children in the treatment groups represented several 
schools but for the purpose of coniparing the scores in the 
several areas, the assxamption was made that the children 
in each group attended the same school. 

Beginning-of-year norms were based on results 
obtained in a standardization program in which the Iowa 
Tests of Basic Skills were administered in approximately 
350 school buildings in grades three through six. Mid- 
year and end-of-year norms were then established by inter- 
polation. ^ 

Table 10 and Table 11 provide the percentile rankings 
for Grade Four and Grade Three. If a group had a percentile 
ranking of 7, for instance, this means that that group, con- 
sidered as a school, scored higher than seven percent of all 
other schools, at that grade level, and for that test. 

Fourth Grade; For the LCE group, the Language total 
score and subtest scores were the strongest scores (15 to 
39). The Work-Study Skills were second strongest. Arith- 
metic was next while Reading (10) and Vocabulary (5) were 
the lowest. 

For the ICS group the percentiles were all almost 
the same, ranging between four and fifteen. The Language 
Skills percentiles were the highest. 
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Table 10 

Percentile Ranks of Grade- Equivalent Averages on 
Iowa Tests of Basic Skills for Grade Four 
in Year Four (1967-68), San Antonio^ 



Test 


ICE?- 


LCS^ 


SAC* 


Vocabulary 


5 


4 


7 


Reading 


10 


5 


10 


Spelling 


38 


9 


14 


Capitalization 


15 


9 


11 


Punctuation 


33 


10 


15 


Usage 


39 


15 


15 


Total Language 


28 


11 


13 


Maps 


22 


7 


12 


Graphs 


22 


7 


15 


References 


17 


7 


13 


Total Work-Study 


20 


7 


13 


Arithmetic Concepts 


15 


4 


10 


Arithmetic Problems 


21 


5 


8 


Total Arithmetic 


16 


5 


8 


Composite 


14 


6 


10 



^Taken from End-Of-Year (April 15) Nbrms for School 
Averages, Grade 4, Manual for Administrators , p. 74. 

2ix:e = Language-Cognition English 

^LCS = Language-Cognition Spanish 

^SAC = San Antonio Curriculum 
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Table 11 

Percentile Rauiks of Grade-Equivalent Averages on 
Iowa Tests of Basic Skills for Grade Thr i --, 



in Year Four (1967-68), 


San Antonio^ 




Test 


LCe2 


NOA-LCE^ 


LCS^ 


SAC^ 


Vocabulary 


8 


10 


5 


21 


Reading 


8 


8 


3 


20 


Spelling 


14 


19 


8 


44 


Capitalization 


12 


10 


4 


50 


Punctuation 


19 


15 


7 


62 


Usage 


15 


7 


5 


50 


Total Language 


13 


11 


5 


40 


Maps 


6 


6 


9 


32 


Graphs 


9 


13 


5 


27 


References 


11 


11 


6 


24 


Total Work-Study 


8 


8 


6 


30 


Arithmetic Concepts 


14 


10 


8 


37 


Arithmetic Problems 


22 


22 


7 


44 


Total Arithmetic 


15 


15 


6 


39 


Composite 


10 


10 


5 


30 



. ^aken from End-Of-Year (April 15) Nbnns for School 
Averages, Grade 3, Manual for Administrators , p. 73. 

2 

'LCE = Language-Cognition English 

^NOA-LCE = Non-Oral-Aural until Year Four and then 
Language-Cognition English 

^LCS = Language-Cognition Spanish 
^SAC » San Antonio Curriculum 
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The pattern for the SAC group was similar to that 
of the LCE group. Language Sld.lls had the highest per- 
centiles and Work-Study Skills were second highest. Again 
Vocabulary and Reading Coitprehension were lowest. 

Considering axl the groups together, the Lcinguage 
Slcills received the highest percentiles and Reading Compre- 
hension and Vocabulary the lowest. As a matter of fact, 
for all three groups no percentile rank for any subtest or 
total score was lower than Vocabulary. 

Third Grade: The overall pattern for third grade 
was similar to the fou-»-tli grade in that Vocabulary and 
Reading Ccitprehension were again the weakest relatively. 
The Arithmetic Skills total percentile ranking was highest 
for ICE, NOA-LCE and LCS and second highest for the control 
(SAC) group. The second highest rankings for LCE and NOA- 
LCE were in Language Skills. As with the fourth grade, the 
LCS percentile rankings were all very close together (3rd 
to 9th percentiles ) . 

In both grades, the highest percentile for an ex- 
perimental treatment group was 39, for the fourth grade 
IX:e group in Usage. Most of the percentile rankings were 
xmich lower. 




Inferential Statistics 
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Fourth Grade 

The major null hypothesis for fourth grade stated 
that after four continuous years the LCE, LCS and Control 
(SAC) groups will not differ significantly on fifteen sxab- 
test and tota2. scores from the Iowa Tests of Basic Skills , 
The 3tatistical findings for analysis of variance and analy- 
sis of covariance are reported on Table 8, 

I^: Analysis of variance revealed no significant 
differences in non-verbal IQ as measured by The Lorae- 
Thorndike Intelligence Tests . The mean IQ's were: SAC — 
91.62; LCE~88.72 and LCS~87.02. 

Vocabulary ; The results of analysis of variance 
revealed that there was no significant difference in Vocab- 
ulary scores among the treatment groups. Analysis of covari- 
ance was not appropriate because the ass\OTption that the 
regression lines be parallel was not met. Figure 1 provides 
a pictorial display depicting the non-parallel nature of the 
regression lines. It appears that at IQ"s above 90, the 
regular San Antonio curriculxan was more effective while 
below 90, the IX2E and ix:s treatments were more effective. 

The experimental oral language programs apparently 
did not result in differential growth in vocabulary as 
measured by the Iowa yests of Basic Skills . Since the tests 

ERLC 




Pigur« X. Leaat- squares Regression Lines for Each of the 
Three Groups (GradQ Pour, Year Four, X967-68, San Antonio) 
Indicating the Relationship Between The Lorae^Thorndike • 
Intelligence Tosta Scores and the Iowa Tests of BaaiTTk ^ lls 
(Vocabulary subscale) Grade Equivalence Scores* These lines 
Kirere extended approximately two standard devjLetione £ron the 
Oraad Mean IQ* 
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were administered in April, the typical child would have 
attained a score of 4.8 (fourth grade, eighth month). The 
mean scores for all three groups were more than a year below 
norm: SAC — 3.5 (third grade, fifth month); LCE — 3-4 and 
LCS— 3.3. 

Reading Contprehension : The results of analysis o;. 
variance showed that there was no significant difference in 
Reading Conprehension scores among the treatiment groups. 
Analysis of covariance using IQ scores as the co variable 
also revealed no significant differences. The null hypothe- 
sis was accepted. The mean scores were: LCE — 3.8; SAC — 3.8 
and LCS — 3.5. ^ 

According to the results from this test, four years 
of intensive language training using the methods developed 
for the San Antonio Language Research Project did not produce 
differential growth in reading ability as measured by the 
Iowa Tests of Basic Skills . 

Language Skills— Spelling: In Spelling, significant 
differences were found favoring the LCE group (p = .004). 
The mean score of the LCE group was 4.6. The LCS group mean 
was almost an entire year lc./er, 3.7. The control group 
(SAC) mean was 4.0. By using t-tests to coirpare each pair 
of means, it was found that the scores of the LCE and SAC 
groups were not significantly different, nor were those of 
the SAC and LCS groups. However, the LCE and LCS mean 
scores were significantly different at the .001 level. 
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Analysis of covariance was not appropriate because 
the regression lines were not parallel. Pigxire 2 provides 
a pictorial display showing the non-parallel nature of the 
regression lines. The LCE group performed much better than 
the ICS group at all IQ levels. Above IQ of 100, the SAC 
group scores were highest: between about 90 and 100, they 
were between the lce and ICS scores: below 90, the SAC treat- 
ment was not as effective as the experimental language treat- 
ments. 

On the basis of analysis of variance only, the null 
hypothesis was rejected. The results of this test indicate 
that the Ix:e treatment had a beneficial effect on development 
of spelling ability as measured by the Iowa Tests of Basic 
Skills . 

Lancniaa e Skills — Capitalization ; According to both 
analysis of variance eind analysis of covariance urjing IQ as 
the covariable, there was no significant difference in scores 
on the Capitalization subtest among the treatment groups. 
Therefore the null hypothesis was accepted. The mean scores 
were: LCE~4.07 SAC— 3.8 and LCS— 3.7. 

If one of the groups receiving special lemguage 
instruction had scored significantly higher or lower than the 
control group, it would have been difficult to supply an 
explanation. There was nothing in the experimental curriculum 
directed specifically at developing ability to capitalize 
correctly. 
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Figura 2* Least^-squarea Regraasion Linaa for Bach of th*» 
Thraa Giro^ (Grade Fdur« Year Four, 1967-*68, San Antonio) 
Indicating the Relationahip Betweon The Loraa^^Hiomdika • 
'ntellig^nce Testa Scorea and the Iowa Te^ta of Baaic Skilla 
4p^inO •uhacala) Orada Bquivalence Scorea* ^tkeee llnee 
mre es^^tended - pproximately tuo ataadard 40ViatloM twm the 
" 10* 
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Language Skills — Pvmctuatio n: Analysis of variamce 
and analysis of covaricince using IQ as the covariable re- 
vealed no significant difference i-i scores on the Punctuation 
subtest among the treatment groups. Therefore the null hy- 
pothesis was accepted. The mean scores were: LCE — 4.5; 
SAC — 4.0 and ICS — 3.8. Again, this result was eacpected. 
The language lessons had no content directed specifically at 
punctuation skills. 

Languag e Skills — Usaae : According to analysis of 
variance there was no significant difference in scores on the 
Usage subtest among the treatment groups. Thus, on the basis 
of analysis of variance, the null hypothesis was accepted. 
The mean scores were: LCE — 4.5j SAC — 3.8 and LCS — 3.8. 

According to analysis of covariance with IQ as the 
covariable, however, there was a significant difference 
(p = .044). Tierefore, on the basis of analysis of covariance, 
the null hypothesis was rejected. Figure 3 graphically por- 
trays the results of e/.alysis of covariance. When scores 
weie adjusted foi IQ, the LCE treatment group scores remained 
highest at *ii.l IQ levels, but the LCS scores became second 
highest and the SAC scores were lowest. The order of these 
scores supports the hypothesis that the LCE trjatment «rd 
perhaps the LCS treatment had beneficial effects on growth 
in langiiage usage as measured by a %«:itten test, the lowa 
Tests of Basic Ski lis . 
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Lanqi. age Skills — Total ; Analysis of variance re- 
vealed no significant differences in Language Skills — Total 
mean scores among the treatment groups, although the proba- 
bili:::y level did approac>i significance (p = .052). On the 
basis of analysis of variance, the null hypothesis was 
accepted. The roean scores were: LCE — 4.4: SAC — 3.9 and 
LCS— 3.S. 

When considering IQ as a covariable ana using analy- 
sis of covariance, a significant difference was found among 
the mean scores of the Language Skills — Total siibscale 
(p = .045). Thus, the null hypothesis was rejected. Figure 
4 presents graphically the results of analysis of covariance 
At all IQ levels, the LCE adjusted scores were almost six 
months higher than the LCS and SAC scores. The LCS and SAC 
scores were almost the same. This finding indicates that 
the LCE treatment had a beneficial effect on performance in 
the language area as measured by a written test, the Iowa 
Tests of Basic Skills . 

Work-St udy Skills — Map Reading : According to both 
analysis of variance and covariance, using IQ as the covari- 
able, there was no significant difference in the scores on 
the Map Reading siabtest among the three treatment groups. 
Tl^e nail hypothesis was accepted. The mean scores were: 
LCE— 4.2; SAC~3.9 and LCS— 3.7. This result was not 
entirely expected since the fourth grade language Ic^ssons 
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Figure 4. Least-squares Regression Lines for Each of the 
Three Groups (Grade Four, Year Four, 1967-68, San Antonio) 
indicating tne Relationship Between The Lorae-Tho rndik^ 
IntclligoncP Tf>sr^ Scores and the Iowa Tests of Bas:Lc Skill s 
Oanguage Skills—Total Subscale) Grade Equivalence Scores, 
rhese lines were extended approximately two staixiard devia- 
tions from tiie Grand Mean IQ. ~««i^u oevia 
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included activities involving construction and reading of 
maps. Unless the regular San Antonio curriculiim included 
many of these kinds of activities the LCE and LCS should 
have shown greater strengths in this area. What was in- 
cluded in the regular San Antonio curriculum is not known. 

Worv-Study Skx lis — Reading Graphs and Tables : Both 
analysis of variance and of covariance revealed no signifi- 
cant differences in the mean scores of the three treatment 
groups on the subtest Reading Graphs and Tables. The null 

hypothesis was accepted. The mean scores were: LCE 4.2: 

SAC— 4.0 and LCS — 3.7. 

Some of the AAAS activities in the LCE and LCS treat- 
ments involved the use of graphs and tables. Therefore it 
was expected that the LCE and LCS groups would have shown 
relative strengths or this subtest. Thus the results of 
analysis of varj^aijctf^ and Cv^variance were not those escpected. 

Work-Studv Skills— K nowledge and T?.c. .. of Reference 
Materials : Analysis of variance and analysis of covariance 
revealed no significant differences in the mean scores of the 
treatment groups. The null hypothesis was accepted. The 
mean scores were: LCE— 4.1; SAC— 4.0 and LCS— 3.7. 

Work-Studv Skillfl.-TQtal . According to analysis of 
variance there was a significant difference in Total scores 
of the Wbrk-Study Skills tests (p = .048). On the basis of 
analysis of variance, the null hypothesis was rejected. The 
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mean scores were: LCE— 4.2; SAC— 4.0 and LCS— 3.7. Ac- 
cording to t-tests, neither the differences between the mean 
scores of the lce and SAC groups nor between the SAC and LCS 
groups were significant. The difference between the LCE and 
LCS mean scores was significant at the .05 level. 

According to analysis of covariance, however, there 
was no significant difference among the means, mus on the 
basis of analysis of covariance the null hypothesis was 
accepted. 

Since the AAAS activities involved use of both maps 
and graphs, it was expected that the LCE and LCS groups would 
have performed significantly better than the SAC group on the 
Maps and Graphs subtests and on the Work-Study Skills— Total. 
This differential growth was revealed for the LCE group only 
with analysis of variance as applied to the scores of Work- 
Study Skills — Total. 

Arithmetic Ski 1 1s — Arithmetic Concepts : The results 
of analysis of variance showed that there was a significant 
difference in the loean scores on the Arithmetic Concepts sub- 
test (p = .046). Therefore, on the basis of analysis of 
variance, the rull hypothesis was rejected. The mean scores 
were: LCE— 4.1? SAC— 3.9 and LCS— 3.6. T-tests found sig- 
nificant differences only between the LCE and LCS mean scores 
(p = .05). 

The findings favoring the LCE group according to 
analysis of variance may reflect the fact that some of the 
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oral language lessons included topics in arithmetic or arith- 
metic related concepts, e.g., shorter than , longer than , and 
thicker than . Results of analysis of covariance revealed no 
significant differences, but the probability level did ap- 
proach significance (p = .057). 

Arithmetic Skills — Arithmetic Problem Solving : Ac- 
cording to both analysis of variance and analysis of covari- 
ance, there were no significant differences in the scores of 
the three treatment groups on the Arithmetic Problem Solvi.-.g 
subtest. The significant difference favoring the lce group 
found through analysis of variance in Arithmetic Concepts 
was not found in the scores on the Arithmetic Problem ^.^Iving 
subtest. The null hypothesis was accepted. The mean scores 
were: LCE~4.2t SAC~3.8 and LCS— 3.6. 

Arithmetic Skills— TotR 3 r On Arithmetic Skills — 
Total, significant differences were found favoring the LCE 
group using analysis of variance (p = .046). The null hy- 
pothesis was rejected. The mean scores were: LCE— 4.1; 
SAC— 3.8 and LCS— 3.6. According to t-t-.ests, the difference 
between the means of the LCE and LCS groups was significant 
at the .05 level, but the differences between the means of 
the LCE and SAC groups and between the SAC and LCS groups 
were not significant. When analysis of covariance was used, 
no significant differences were found in Arithmetic Skills- 
Total, and the null hypothesis was accepted. 



Composite : Analysis of variance revealed no signifi- 
cant differences among the treatment groups on Composite mean 
scores. On the basis of analysis of variance, the null hy- 
pothesis was accepted. Mean scores were: LCE— 4.0; SAC— 
3.8 and LCS— 3.6. Analysis of covariance was inappropriate ' 
t .^cause the assumption thai: the regression lines be parallel 
was not met. Figure 5 depicts the non-parallel nature of 
the regression lines. The Ix:e scores were superior to the 
ICS scores at all IQ levels. At IQ's above about ^5, the SAC 
^scores were superior to both the special language treatments. 
Thus, according to analysis of variance only, the 
LCE and LCS tijeatir^nts did not produce differential growth 
in vocabulary, reading, language, work-study and arithmetic 
skills as summarized by a composite mean score on the lowa 
Tests of Basic Ski 1 1 s , 

In summary, according to analysis of variance, there 
were significant difference., among the three treatment groups 
only for Spelling (p = .004), Work-Study Skills—Total (p - 
.048), Arithmetic Cc.cepts (p = .046), and Arithmetic Skills- 
Total (p = .046). In all cases, the LCE group had the highest 
mean and the u:s group the lowest. The following generaliza- 
tion, based on analysis of variance, describes the differences, 
limited to those identified above as significant, between the 
groups: LCE:^SAC>Ix:s, i.e., the LCE group was superior to 
the SAC group which was superior to the LCS group. 
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Figure 5. Least-squares Regression Lines for Each of the 
Ti*ree Groups (Grade Four, Year Four, 1967-68, San Anto^iio) 
Indicating the Relationship Between The LQrqG--Thorndike 
Intelligonco T«^5ts Scores and the Iowa Tests of D . -^sic Skills 
(CoR^dite Subacale) Grade Equivalence Scores. These lines 
v^re extended approximately two standard deviations from the 
Grand Mean IQ. 
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According to analysis of covariance, there were sig- 
nificant differences among the groups only for Usage (p = 
.044) and Language Skills— Total (p = .045). The adjusted 
means were highest for the LCE group and lowest for the SAC 
group. The LCS adjusted means were much closer to the SAC 
mean scores. Thue , uhis limited generalization is appropri- 
ate: LCE>LCS = SAC, i.e., the LCE group was superior to the 
LCS group which v/as only slightly superior to the SAC group 
for Language Usage and Language Skills — Total. 
, The figures depicting the regression lines for analy- 

sis of covariance for the Vocabulary, Spelling and Composite 
subscales seem to indicate,_that the LCE and LCS treatments 
were more effective for children with lower IQ's, while chil- 
dren with higher IQ ' s performed better in the regular San 
Antonio curriculxam. 

Third Grade 

The major null hypothesis for third grade stated that 
after three continuous years the LCE, LCS and NOA-LCE and 
Control (SAC) groups will not differ significantly on fifteen 
subtest and total scores from the Iowa Tests of Basic Skills . 
The statistical findings for analysis of variance and analy- 
sis of covariance are reported in Table 9. 

Ifi: Analysis of variance revealed significant dif- 
ferences in non-verbal IQ as measured by The Lorae-Thorndike 
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Intelligence Tests (p = .001). The Control group (SAC) had 
the highest mean IQ score, 91.96, and the LCE group had the 
next highest, 88.88. The LCS and NOA-LCE groups had almost 
identical mean IQ scores, 82.30 and 82.58, respectively. 
According to r-tests, the differences between the SAC and 
u:e groups and between the NOA-LCE and LCS groups were not 
significant. All other differences (SAC vs. LCS: SAC vs. NOA- 
LCE: I/:e vs. LCS: and LCE vs. NOA-LCE) were significant at the 
.001 level. 

Vocabulary ; The results of analysis of variance re- 
vealed significant differences in Vocabulary scores among 
the treatment groups (p = .001). Therefore the null hypothe- 
sis was rejected. The Control (SAC) group received the 
highest score, 3.2 (third grade, second month). The scores 
for the eaqjerimental groups were all a year or more below 
the scores for the typical pupil at this time of year: 3.8. 
The mean scores were: NOA-LCE — 2.8: LCE — 2.7 and LCS — 2.5, 
T- tests revealed that the difference between the SAC group 
mean and the NOA-LCE mean was significant at the .01 level; 
that the differences between the SAC mean and the LCE and 
LCS means were each significant at the .001 level; that the 
difference between the NOA-LCE mean and LCS mean was signifi- 
cant at the .05 level; and that the differences between the LCS 
mean and both the NOA-LCE and LCE means were non-significant. 
The e>Q)erimental language treatments clearly did not produce 
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differential growth in vocabulary, as measured by the Iowa 
Tests of Basic Skills, and not accounting for differences in 
IQ. 

Analysis of covariance was not appropriate since the 
assun^jtion that regression lines be parallel was not met. 
Figure 6 depicts the non-parallel nature of the regression 
lines. That this test could not be performed was iinfortunate 
since one of the major goals of the language programs was to 
increase vocabulary. The figure showing the regression lines 
reveals that no generalization is possible at IQ's below 
about 100. Above 100, however, the order of the scores from 
highest to lowest is: SAC; NOA-LCE; LCE; and LCS. For the 
higher IQ's, at least, the experimental treatments did not 
have the hoped for beneficial effect on vocabulary develop- 
ment. 

Reading Comprehension; Analysis of variance found 
significant differences in Reading Comprehensioti scores 
among the treatment groups (p = .001). Using analysis of 
variance, the null hypothesis was rejected. The mean scores 
were: SAC~3.3; LCE— 2.9; NOA-LCE~2.9 and LCS— 2.6. T- 
tests showed that the difference between the LCE and NOA-LCE 
means was not significant. Conparisons of all other differ- 
ences between means showed them to be significant at the .05 
level or higher. 
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Figure 6. least-squares Regression Unes for Each of the 

Pour Groups (Grade Three, Year Ptour, 1967-68, San Antonio) 

Indicating the Relationship Between The Loraa-Thomdike 

Intelligence Teats Scores and the Iowa Teats of BaaifTaTcin a 

(Vocabulary Subscale) Grade Bqiuvalence Scores. These lines 

)^Sfl,*S*^ approximately two standard deviations f ro« the 
Grand Mean XQ. 
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When IQ was controlled by using analysis of covari- 
ance, significant differences were found (p = .014). The 
null hypothesis was rejected. Figure 7 graphically portrays 
the results of analysis of covariance. When the scores were 
adjusted for IQ, the highest scores were still obtained by 
the SAC group; the scores of the NOA-LCE group drew ahead of 
those of the LCE group: and the lowest scores were still ob- 
tained by the ICS group. At this grade level, according to 

Iowa Tests of Basic svin«. the hoped for effect on reading 
of the ICE and ICS treatments was not obtained. 

Lancfuaae Skills— Spelling; The results of analysis 
of variance revealed that there was a significant difference 
in the scores on the Spelling subtest of the four treatment 
groups (p = .001). On the basis of analysis of variance, the 
null hypothesis was rejected. The mean scores were: SAC— ' 
3.3j NOA-ICE— 3.3t IX:e— 3.1 and LCS~2.8. According to t- 
tests, the differences between the SAC mean and the mean of 
every other group were significant at least at the .05 level. 
The differences between the NOA-LCE, LCE and LCS groups were 
not significant. 

The findings of analysis of covariance also revealed 
Significant differences (p = .001). Therefore, on the basis 
of analysis of covariance, the null hypothesis was rejected. 
Figure 8 portrays graphically the results of analysis of 
covariance. When scores were adjusted for IQ, the SAC group 
still received the highest scores and the NOA-LCE the second 
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Figure 7. Least-squaree Regression Unes for Each of the 
Four Groups (Grade Three. Year Four, 1967-68, San Antonio) 
indicating the Relationship Between The Lorae- Thorndike 
Jntoiiicfonco Ti-dt.s Scores and the Iowa Tests of Baaic~kill a 
(Reading Comprehension Subscale) Grade Equivalence Scores. — 

•^^•[1^ proximately tuo standard devia- 
tions zxom the Grand Mean IQ. 
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Figure 8. Least-squares Regression Lines for Each of the 
Pour Groups (Grade Three, Year Four, 1967-68, San Antonio) 
Inoxcating the Relationship Between The Lorge-Thorndike 
IntellirLcnce Tests Scores and the Iowa Tests of Basic Skills 
(Spelling Subscale) Grade Equivalence Scores. These lines 
mre extended approximately two standard deviations from 
the Grand Mean IQ. 
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highest; the Ix:e and LCS received almost equal scores. The 
oral language programs in English and Spanish did not pro- 
duce beneficial effects on spelling ability as measured by 
the Iowa Tests of Basic Skills . 

Lanjuaa e Skills — Capitalization: According to analy- 
sis of variance, there were significant differences in the 
scores on the Capitalization subtest among the four treatment 
groups (p = .001). Again the SAC group scored the highest 
mean, 3.8, and the LCS group the lowest, 2.6. The difference 
between the highest and lowest mean was more than an entire 
year in grade equivalents. The LCE and NOA-LCE groups re- 
ceived almost the same score, 3.0 and 2.9, respectively. T- 
tests showed that this difference was not significant. The 
largest differences between means were between each e3q>eri- 
mental group and the control (SAC) group. According to t- 
tests, all these differences were significant at the .001 
level. Also, the difference between the LCS mean and the LCE 
mean was significant at the .05 level, while the difference 
between the LCS and NOA-LCE means was not significant. There 
is no obvious feature of the experimental curriculum that 
provides an explanation for this pattern of scores. 

On the basis of analysis of variance, the null hy- 
pothesis was rejected. Analysis of covariance was not appro- 
priate because the regression lines were not parallel. Figure 
9 depicts the non-parallel nature of the regression lines. 



185 



SAC • San Antonio Curriculum 
ICS » Language-Cognition Spaniah 

UIZ = Language-Cognition English 
NQA-LCE s Non-Oral-Aural/Language- 




70 80 90 100 110 



Intelligence Quotient 



Figure 9. Least-squares Regression Lines for Each of the 
Four Groups (Grade Three, Year Four, 1967-68, San Antonio) 
Indicating the Relationship Betvreen The Lorae-Thomdike 
Intolliaence Tosta Scores and the Iowa Teats of BaaiTTkil 
(Capitalization Subscaie) Grade Equivalence Scores. — These 
lines Mrs extended approximately two standard deviations 
from the Grand Mean IQ. 
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Figure 9 shows chat at IQ's above about 90, the 
typical ordering of scores occurs again. From highest to 
lowest they are: SAC, NOA-LCE, ICE and LCS. The differ- 
ences are quite pronounced. Below 90, no generalization 
can be made about the scores of the foxir groups. 

Language Skills — Punctuation ; The results of analy- 
sis of ^^lance revealed that there was a signif iccuit dif- 
ference among the four treatment groups on the scores of the 
Punctuation subtest (p = .001). On this basis the null hy- 
pothesis was rejected. The mean scores were: SAC — 4.0: 
ICE— 3.3: NOA-LCE — 3.2 and LCS — 2.9. By applying t-tests to 
each pair of means, it was found that the differences between 
the SAC mean and the mean of each other group were significant 
at the .01 level or higher. The difference between the LCE 
and LCS means was significant at the .05 level. The differ- 
ences between the NOA-LCE and both the LCE and XCS groups 
were non- signif icant. There is no obvious explanation for 
this pattern of scores or this particular test. 

On the basis of analysis of variance, the null hypoth- 
esis was rejected. Controlling for IQ by using analysis of 
covariance was not appropriate since the assun5)tion that the 
regression lines be parallel was not met. 

Figure 10 shows the non-parallel nature of the re- 
gression lines. Above an IQ of 80, children achieved higher 
scores if they were in the SAC or NOA-LCE treatment groups. 
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Figure 10. Least-squares Regression Lines for Each of the 
pour Groups {Grade Three. Yesr Pour, 1967-68, San Antonio) 
Indicating the Relationship Between The Lorqe-Thorndike 
Intelli gence Teats Scores and the Iowa Teats of Basic Skills 
(Punctuation Subscale) Grade Equivalence Scores. These lines 
were extended approximately two standard deviations from the 
Grand Mean IQ. 
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Language Skills — Usage : The null hypothesis was 
rejected on the groxands that, both with analysis of variance 
and analysis of covariance, significant differences were found 
among the groups on the Usage subtest scores (p = .001). The 
mean scores were: SAC— 3.8; LCE— 3.0; NOA-LCt ..id ICS-- 

2.6. The differences between each of the escperimental group 
means and the SAC group mean were significant at the .001 
level. The differences between the LCE mean and the NOA-LCE 
and ICS means were significant at the .05 and .01 levels, 
respectively. The LCS and NOA-LCE mean scores^ were not sig- 
. nificantly different. 

Figure Ix depicts the results of analysis of covari- 
ance. When the scores were adjusted to account for lO, the 
order of the groups remained the same except that the differ- 
ences between the LCE and NOA-LCE scores disappeared. 
These findings provide evidence that the LCE and LCS treat- 
ments did not produce greater growth than the regular curric- 
ulum in language usage as measured by the Iowa Tests of Basic 
Skills . 

Language Ski lis— Total ; The results of both analysis 
of variance and of covariance showed that there were signifi- 
cant differences in the mean scores on Language Skills — Total 
among the four groups (p = .001). The mean scores were: 
SAC— 3.8; I/:e— 3.1; NOA-LCE— 3.0 and LCS— 2.7. According to 
t-tests, the differences between the means of each eaqjeri- 
mental treatment group and the SAC mean were significant at 
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Figure 11. Least- squares Regression Lines for Each of the 
Pour Groups (Grade Three, Year Four, 1967-68, San Antonio) 
Indicating the Relationship Between The Loroe-Thomdike 
Intelligence Tests Scores and the Iowa Tests of Basic Skills 
(Usage Subscale) Grade Equivalence Scores. These lines %#ere 
extended approximately two standard deviations from the 
Grand Mean IQ. 
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the .001 level. The difference between the LCE and LCS means 
was significant at the .05 level. The NOA-JjCE mean was not 
significantly different from either the LCE or the LCS mean. 

Figure 12 presents graphically the results of analy- 
sis of covariance. When the scores were adjusted for IQ, 
the LCE group dropped from second place to third place, so 
again the children in the ejqierimental language groups re- 
ceived the lowest scores. 

The null hypothesis was rejected on the basis of both 
analysis of variance and of covariance. Language skills as 
measured by a written test, the Iowa Tests of Basic Skills , 
were not developed differentially by the LCE and LCS treat- 
ments . 

Work-Studv Skills — Map Reading ; According to analysis 
of variance there were significant differences among the four 
groups on the Map Reading subtest (p = .001). The SAC group 
mean was highest: 3.5. The three experimental group means 
were about the same: LCS— 3.0; LCE and NOA-LCE— 2.9. T- 
tests showed that the differences between the SAC mean and 
the NOA-u:e, ix:e and LCS means were significant at the .001, 
.001 and .05 levels, respectively. There were no significant 
differences between each combination of LCE,. LCS and NOA-LCE 
group means. On the basis of analysis of variance, the null 
hypothesis was rejected. 

Analysis of covariance was not appropriate because 
the regression lines were not parallel. Figure 13 displays 
the non-parallel nature of the regression lines. At very 
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Figure 12. Least-squares Regression Lines for Fa^^h 
Four Groups (Grade Three, Yeir ftour? isevfea sin L^Lf^t 
indicating the Relationship Bet^en' Tje Lra;-^"r^fg2 '°^ 
Intelligence m-^r^ Scores and the Io wa Testa of gv s 1 1 , 
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low IQ levels, the experimental language treatments (LCE and 
ICS) were superior. Above an IQ of 90. the regular San Antonio 
curriculvim and NOA-LCE treatment were superior. 

Work-Study Skills — Reading Graphs and Tables : The 
results of analysis of variance showed that there were sig- 
nificant differences among the four groups on the Graphs sub- 
test (p = .002). On the basis of analysis of variance, the 
null hypothesis was rejected. The mean scores were: SAC— 
3.4; NOA-LCE— 3.1; LCE— 3.0 and LCS— 2.8. Again the differ- 
ences between the SAC means and those of the NOA-LCE, LCE 
and LCS groups were significant according to t-tests. The 
probability levels were .05, .05 and .01, respectively. The 
differences between the NOA-LCE and the LCE groups, between 
the NOA-LCE and LCS groups, and between the LCE and LCS groups 
were not significant. 

According to analysis of covariance, there were sig- 
nificant differences among the groups on the scores of the 
Graphs and Tables subtest. Figure 14 presents graphically 
the results of analysis of covariance. When the scores were 
adjusted for IQ, it was found that the SAC and NOA-LCE groups 
scored the same and the LCE and LCS groups received almost 
equal scores about three months lower than the other two ■ 
groups. Apparently the experiences involving graphing in 
the science lessons were not effective in developing superior 
skill in using graphs in the LCE and LCS groups as measured 
by the Iowa Tests of Basic <5lcm.Q- 
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Work-Study Skills — Knowledge and Use of Reference 
Materials; Analysis of variance found significant differ- 
ences in the scores on the References subtest (p = .007). 
On the basis of analysis of variance, the null hypothesis was 
rejected. The mean scores were: SAC — 3.4; LCE — 3.1; NOA- 
1J2E — 3.1 and LCS — 2.9. T-tests revealed that there were no 
significant differences between each pair of means for the 
e3q)eriinental treatment groups. The difference between the 
SAC mean and NOA-LCE mean was significant at the .05 level, 
as was the difference between the SAC mean and the LCE mean; 
the difference between the SAC mean and the LCS mean was sig- 
nificant at the .01 level. 

Analysis of covariance was not appropriate because 
the regression lines were not parallel. Figure 15 depicts the 
non-parallel nature of the regression lines. At IQ's above 
100, the SAC and NOA-LCE treatments were superior to the LCE 
and UlS treatments. Below 100, the interaction was so grsat 
that it is iirpossible to generalize. 

Work-Stu dy Skills— Total ; According to analysis of 
variance there were significant differences in the scores 
among the four groups. Therefore the null hypothesis was 
rejected. The mean scores were: SAC — 3.5; LCE — 3.0; NO;i- 
Ix:e — 3.0 and I/:s— 2.9. According to t-tests, the differences 
between the SAC group mean and the mean of the NOA-LCE, LCE 
and LCS groups were significant -at the .01 level or higher. 
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Figure 15. Least-squares Regression Lines for Each of the 
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Intelligence Tests Scores and the Iowa Tests of Basic Skills 

(Knowledge and Use of Reference Materials Subscale) Grade 

Equivalence Scores. These lines were extended approximately 
two otandard deviations from the Grand Mean IQ. 



197 

The differences between the means of the LCE and NOA-LCE 
groups, between the LCE and LCS groups, and between the NOA- 
LCE and LCS groups were not significant. 

Analysis of covariance was not appropriate because 
the regression lines were not parallel. Figure 16 depicts 
the non-parallel nature of the regression lines. As happened 
before at the higher IQ levels, the order of the groups from 
highest to lowest was: SAC, NOA-LCE, LCE and LCS. Below 
about an IQ of 90, the interaction prevents any generaliza- 
tion from being made. 

Arithmetic Skil ls — Arithmetic Concepts ; Ttoalysis of 
variance found significant differences among the groups on 
the scores of the Arithmetic Concepts subtest '(p = .001). 
The null hypothesis was rejected. The mean scores were: 
SAC— 3.6; IX:E~3.2: NOA-LCE— 3.1 and LCS— 3.0. Again the 
differences between the SAC group mean and the mean of each 
of the experimental groups were significant. The probability 
levels were .01 for the NOA-LCE and LCE groups and .001 for 
the LCS group, when the experimental gror.p means were com- 
pared, no significant differences were found. 

Analysis of covariance was performed but no signifi- 
cant differences were found (p = . 076 ) . Therefore the null 
hypothesis was accepted. 

Arithmetic Skill s — Arithmetic Problem Solving : There 
were significant differences among the groups on the scores 
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on Arithmetic Problem Solving according to analysis of vari- 
ance (p = .005). Therefore the null hypothesis was rejected. 
The highest score was obtained by the SAC group, 3.6; the 
lowest by the LCS. 2.9. According to t-tests. the difference 
between these two scores was significant at the .001 level. 
The LCE and NOA-LCE groups scored 3.3. These scores were not 
significantly different from the SAC mean, but each was sig- 
nificantly different from the LCS mean at the .05 level. 
Analysis of covariance found no significant differences (p = 
.084) so the null hypothesis could not be rejected on the 
basis of analysis of covariance. 

The two Arithmetic Skills subtests are the only two 
tests for which analysis of covariance was appropriate and 
for which significant differences were not found. In all 
other cases the SAO group obtained the highest mean scores or 
were equal to the highest. This difference in the pattern of 
scores means that it is possible that the exercises involving 
mathematics within the AAAS materials were effective in pro- 
ducing relatively greater growth in arithmetic skills. 

Arithmetic S kills— Total : According to analysis of 
variance there were significant differences in the mean scores 
of the four groups (p = .001). On the basis of analysis of 
variance, the null hypothesis was rejected. The mean scores 
were: SAC— 3.6; Ix:e-3.2; N0A-LCE.--3. 2 and LCS-2.9. The SAC 
mean score was significantly different from the NOA-LCE, LCE 
and LCS means at the .05, .01 and .001 levels, respectively. 
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The mean of the ISO^-ICE group was not significantly differ- 
ent from the iCE and LCS group means, but the LCE group mean 
was significantly different from the LCS mean at the .05 
level. 

According to analysis of covariance, there were sig- 
nificant differences when IQ was controlled (p = .030). 
Figure 17 depicts the results of analysis of covariance. When 
the scores were adjusted for IQ, the NOA-LCE scores were 
almost as high as those of the SAC group. The order of the 
other groups remained the same. The null hypothesis was 
rejected. 

Composite: Analysis of variance applied to the Com- 
posite scores revealed significant differences among the 
groups' scores (p = .001). As with all the other tests, the 
SAC group obtained the highest mean: 3.5. Even this mean 
score is somewhat below that obtained by the typical child in 
the third grade at that time of year: 3.8. The other mean 
scores were: LCE— 3.0 j NOA-LCE— 3.0 and LCS— 2.7. As before, 
the differences between the SAC mean and the means of each 
other group were significant. In each case, the probability 
level was .001. The LCE and NOA-LCE mean scores were not 
significantly different from each other. The means of both 
the LCE and NOA-Ix:e groups were significantly different from 
the LCS mean at the .05 level. 

Analysis of covariance was not appropriate because 
the regression lines were not parallel. Figure 18 portrays 
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the non-parallel nature of the regression lines. Above about 
an IQ of 85, the SAC and NOA-LCE treatments were clearly 
superior to the LCE treatment which was in turn superior to 
the LCS treatment. 

In summary, according to analysis of variance, there 
were significant differences among the groups on IQ as mea- 
sured by The Lorae-Thorndike Intelligence Tests and on every 
test of the Iowa Tests cf Basic Skills . As can be seen on 
Table 9, the probability values are less than .01 in all 
cases. The control (SAC) group always obtained the highest 
mean scores. Except on the Reading Maps sxabtest, the LCS 
group always received the lowest mean scores. The mean scores 
of the LCE and NOA-LCE groups were between the mean scores of 
the SAC and I£:s groups. Neither the LCE nor the NOA--LCE 
group was consistently superior to the other. Thus the fol- 
lowing generalization describes quite accurately the differ- 
ences between the four groups when IQ was not taken into 
consideration: SAC;?'LCE = NOA-LCE >^LCS. 

Given the significant differences in IQ scores, it 
was unfortunate that analysis of covariance was appropriate 
for only eight subtests of the Iowa Tests of Basic Skills . 
Significant differences were found in six of these cases: 
Reading Con^jrehension (p « .014), Spelling (p = .001), Usage 
(p a .001), Language Skills—Total (p « .001), Reading Graphs 
and Tables (p » .028) and Arithmetic Skills— Total (p « .030). 
The adjusted mean scores of the SAC group were always the 
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highest except in one case (Reading Graphs and Tables) when 
they were equalled by the scores of the NOA-LCE group. This 
latter group had the second highest adjusted mean scores in 
all other cases except on the Usage subtest on which the LCE 
mean score was higher. The adjusted mean scores of the LCE 
and ICS groups had the third and fourth highest scores in 
five of the six cases where there were significant differences 
according to analysis of covariance. The LCS group mean 
scores never rose above third highest. No significant dif- 
ferences were found for Arithmetic Concepts and Arithmetic 
Problem solving subtests. The results according to analysis 
of covariance may be summarized generally, limited to the 
significant differences noted above, as: SAC = NOA-LCE > 
LCE = LCS. 

The figures depicting the regression lines for analy- 
ses of covariance for the Vocabulary, Capitalization, Punc- 
tuation, Map Reading, Knowledge and Use of Reference Materials, 
Work-Study Skills— Total and Composite subscales seem to indi- 
cate that the ICE and LCS treatments were more effective for 
children with lower IQ's while children with higher IQ's 
performed better in treatments containing a limited amount of 
oral language instruction. 

After the analyses of variance and covariance were 
performed, one additional corrparison was made. For each sub- 
scale of the Iowa Tests of Basic Skills the mean score of the 
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third grade ICE group was compared to that of the fourth 
grade iCE group and the difference confuted. The same thing 
was done for the ICS and SAC groups. Table 12 lists the mean 
scores for each treatment and the differences between them. 

The interpretation of the comparisons of the third 
and fourth grade scores is based on the assuitption that the 
only difference between the children in the sairple of the two 
grades is one additional year in school, in other words, 
they need to have begun first grade with about the same 
characteristics and abilities and to have had about the same 
experiences since first grade in order for the conparisons to 
be valid. Data were not available to verify the assxarrption. 
It did happen that the IQ scores for the two grade levels in 
each treatment were very similar. 

The scores shown on Table 12 are grade equivalent 
scores. The numeral before the decimal represents years and 
the one after the decimal represents months. Thus, as the 
table shows, the difference between the third and fourth 
grade mean scores for the LCE treatment ranged between seven 
months for Vocabulary and one year and five months for 
spelling. The scores for all the tests except Spelling dis- 
play differences of at least nine months, which is the normal 
length of the school year. If these differences can be in- 
terpreted as a reflection of growth, then the growth is quite 
satisfactory. 
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The differences between the third and fourth grade 
mean scores for the LCS treatment ranged from six months for 
Arithmetic Concepts to one year and two months for Usage. 
The differences overall appear to be a little less than those 
of the LCE treatment, but nevertheless they reflect fairly 
adequate growth. 

The differences for the LCE and LCS treatments are 
in striking contrast to those of the SAC group. These dif- 
ferences range between zero months (for three tests) and six 
months (for two tests). 

Upon analyzing data derived from her language test, 
Taylor found negative differences in Fluency and Total scores 
between the fourth grade and fifth grade control groups in 
Year Five but positive differences for the LCE and LCS 
groups. The children in Taylor's study in the LCE and LCS 
groups were the same children as those in this study minus 
the ones transferred to non-experimental classes. Her con- 
trol groups consisted of different children who came from a 
«chool not in the Project but with a population very similar 
to that of the Project schools. 

Taylor's findings with regard to language in con- 
junction with the ones in this study offer some evidence for 
the following hypothesis: The English language skills of 
native Spanish-speaking children in classrooms where no 



^Taylor, "Year Five Findings/' p. 110. 
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intensive language training is offered will develop to a 
point and then reach a plateau. This plateau in language 
development then limit ^_growth in academic skills such as 
those measured by the Iowa Tests of Basic Skills. 
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CHAPTER IV 

SUMMARY, LIMITATIONS, CONCLUSIONS, 
DISCUSSION AND RECOMMENDATIONS,. : 

SUMMARY 

This Study attenpted to evaluate the effects on pupil 
performance of three and four years of intensive oral lan- 
guage instruction in English or Spanish. The subjects were 
disadvantaged third and fourth grade Mexican-American chil- 
dren who had been members of e3q>erimental classes since 
first grade in the San Antonio Language Research Project. 
The original goal of the Project was to enhance reading 
readiness. As the experimental treatments extended into 
higher grades the objective changed to developing skills in 
reading ^d in other academic subjects. 

Originally there were three treatments. Two treat- 
ments included intensive language instruction, one in English 
(OAE — oral-aural English) and one in Spanish (OAS — oral-aural 
Spanish), and used science exercises from the AAAS Science; 
A Process Approach as the content base. Later, these treat- 
ments were referred to as the LCE (Language-Cognition English) 
and ix:s (Language-Cognition Spanish) groups. Children in the 
initial third treatment, known as NOA (non-oral-aural), par- 
ticipated in the) science activities but received no inten- 
sive language instruction. After the tliird year (1966-67) 
the NOA treatment, which had been considered the "control" 
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group, was dropped by the Project administrators. These 
children were then assigned, insofar as possible, to the 
English and Spanish escperimental classes. 

The attempt was made in this study to include in the 
data analysis scores only from children who had been in the 
same type of experimental treatment for all their years in 
school. The one exception made was to include children in 
the e3q)erimental English classes Tn^third grade who had been 
in NOA classes during the first and second grades. This 
group was referred to as the NOA-LCE group in this study. 

Because the original NOA treatment had been elimi- 
nated, another "control" group was selected, which consisted 
of children in the sa 2 schools but who had never been in 
escperimental classes. They received only the regular San 
Antonio curriculum (not defined, and thus were referred to 
as the SAC group. It was assumed that any differences that 
might occur between the scores of the "control" group and 
the esqserimental groups could be attributed to the experi- 
mental curriculum because the children in the Project 
schools were to have been assigned randomly to experimental 
or non-experimental classes when they entered first grade. 

The table below shows the number of children in- 
cluded in the research saitple: 
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Treatment Group Grade Four Grade Three 

ICE (initially OAE) 29 109 

NOA-LCE — 59 

LCS (initially OAS) 60 40 

SAC 21 68 

For the purposes of this study it was assumed that 
the children receiving special instruction in oral language 
had improved in language ability. The question became one 
of investigating the effect of inqproved language abilities 
on reading and abilities in other academic subject areas. 
The instrianent used to measure these abilities was the lowa 
Tests of Basic Skills which is a written group test mea- 
suring skills in vocabulary, reading comprehension, language, 
work-study and arithmetic. The non-verbal battery of The 
Lorqe-Thomdike In telligence Tests was used to estimate dif- 
ferences in learning capacity. 

Analysis of variance was used to conpare the mean 
scores from the spring administration of the Iowa Tests of 
Basic Skills for the three treatment groups at the fourth 
grade level and the four treatment groups at the third grade 
level. Analysis of covariance also was used to compare the 
mean scores. The purpose of this analysis was to control 
for IQ, as measured by The Lorae-Thorndike Intelligence Tests 
administered to children in both grades in the fall of the 
year they were in third grade. 
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Findings 

Fourth Grade 

According to analysis of variance (see Table 8), 
there were significant differences among the three treatment 
groups only on Language Skills— Spelling (p<.004), Work- 
Study Skills— Total (p<.648), Arithmetic Skills—Arithmetic 
Concepts (p<.046) and Arithmetic Skills — Total (p<.046). 
In all cases, the LCE group had the highest mean score and 
the LCS group the lowest mean score. This generalization, 
based on analysis of variance ^ describes the differences 
which tended to occur: LCE^SAC p'LCS. 

When analysis of covariance with IQ as the covariable 
was applied, the differences listed above disappeared and two 
others occurred. Significant differences were found in 
Language Skills— Usage (p 41. 044) and Language Skills— Total 
(p <. 045 ) . Corrparisons of adjusted means on these two scales 
revealed that the u:e mean scores were highest, the SAC mean 
scores the lowest and the LCS mean scores close to but higher 
than the SAC scores. Therefore, this generalization is ap- 
propri ate : LCE LC S = SAC . 

Examination of three figures (1, 2 and 5) displaying 
the results of analysis of covariance indicated that children 
with lower IQ's, as measured by The Lorqe-Thorndike Intelli- 
gence Testa, benefitted from being in the LCE or LCS treat- 
ments while children with higher IQ's performed better if 
they were in the SAC group. 
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Table 13 provides a siammary of the relationships of 
the mean scores of the treatment groups in the fourth grade 

Iowa Test s of Basic Skills according to analysis of 
variance and analysis of covariance. No scores are reported 
as this was done previously in Tables 8 and 9. Table 13 
lists the groups according to the order of their means. 
Therefore, if a relationship reads LCE = SAC = LCS, this 
means that according to analysis of variance or of covariance 
there were no significant differences, but the LCE group 
mean score was highest, the SAC second, and the LCS group 
mean lowest. If a relationship includes only equal signs, 
that means that, according to analysis of variance or of 
covariance, there were no significant differences among the 
groups. In every case where tht_e were significant differ- 
ences among the groups according to analysis of variance , 
t-tests were applied to each pair of groups. Inequality 
signs in the middle column indicate the results of the 
application of t-tests. In the right column inequality signs 
were obtained by visual examination of the figures display- 
ing the regression lines taken from analysis of covariance 
in the cases where significant differences had been found. 

Third Grade 

According to analysis of variance (see Table 9), 
there were significant differences on every test of the 
Iowa Tests of Baair .gvinQ. The SAC group always obtained 
the highest mean scores. The LCS group received the lowest 
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Table .13 

Summary of ;tesults of Analysis of Variance and Analysis 
or CovaraaTice, San Antonio Language Research Project, 
Grade Four, Year Four (1967-68) 

BasL'^Siflf 7~~- I 1 ^ lysis of Covariance 

Basic Skills Analysis of Variance-'- with 10 as Covariable^ 



Vocabulary 

Reading 

Spelling 

Capitalization 

Punctuation 

Usage 

Total Language 
Maps 
Graphs 
References 
Total Work-Study 
Arith. Con. 
Arith. Prob. 
Total Arith. 
Coirposite 



SAC=LCE=LCS^ 
SAC=LCE=LCS 

LCE=SAC=LCS (LCE>LCS) 

LCE=SAC=LCS 

LCE=SAC=LCS 

LCE=SAC=LCS 

LCE=SAC=LCS 

LCE=SAC=LCS 

LCE=SAC=LCS 
LCE=SAC=LCS 
LCE=SAC=LCS (LCE>LCS) 
LCE=SAC=LCS (LCE>LCS) 
LCE=SAC=LCS 
LCE=SAC=LCS (LCE>LCS) 
LCE=SAC=LCS 



Analysis inappropriate 
LCE=SAC=LCS 

Analysis inappropriate 
LCE=LCS=SAC 
LCE=LCS=SAC 
LCEi»LCS:>SAC 

lce:^lcs=sac 

lce=sac=lcs 

lce=sac=lcs 

lce=sac=lcs 

lce=sac=lcs 

lce=sac=lcs 

lce=sac=lcs 

lce=sac=lcs 

Analysis inappropriate 



(r^y of?^Si^^^i? inequality signs reflect application of t-tests 
igfl'Sfi^r^^ analysis of variance. Equal signs indicate 

that differences m mean scores were not statistically significant. 

Pi^ur^o'^^ii^^A^*^ inequality signs reflect visual examination of 
Figures 3 and 4 as well as analysis of covariance. 
3 

The groups are listed according to the order of their mean 
scores, with the highest first. 
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mean score on every test except the Map Reading subtest. The 
mean scores of the LCE and NOA-LCE groups were in the middle 
with the one exception noted before, with neither group 
being consistently superior to the other. Based on analysis 
of variance, this generalization describes the differences 
between the four groups: SAC>LCE = NOA-LCE >LCS. 

Analysis of covariance was not always appropriate 
because the assuirption that the regression lines be parallel 
was not met in every case. Of the eight instances when 
analysis of covariance was performed, significant differences 
were found in six: Reading Con5>rehension (p.^.014). Lan- 
guage Skills—Spelling (p^L.OOl), Language Skills—Usage 
(p^.OOl), Language Skills — Total (p.^.001), Wbrk-Study 
Skills— Reading Graphs and Tables (p^.028), and Arithmetic 
Skills — Total (p-i:.030). Examination of the adjusted mean 
scores of the four treatment groups on these six tests indi- 
cated that a fcd.r generalization would be: SAC = NOA-LCE?? 
LCE = LCS. 

Observation of six of the figiires (6, 9, 10, 13, 
15, 16 and 18) displaying the results of analysis of covari- 
ance indicated that children with lower iQ's benefitted from 
being in the LCE or LCS treatments while children with 
higher IQ's performed better if they were in the SAC or 
NOA-LCE groups. 

Table 14 provides a summary of the mean scores of 
the treatment groups in the third grade on the Iowa Tests 
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Table 14 

Summary of Results of Analysis of Variance and Analysis 
of Covariance, San Antonio Language Research Project, 
Grade Three, Year Four (1967-68) 



, Analysis of Covariance 
Basic smils Analysis of Variance-*- with IQ as Covariable^ 



Vocabulary 

Reading 

Spelling 

Capitalization 

Pvjnctuation 

Usage 

Total Language 

Maps 
Graphs 

References \ 
Total Work-Study 
Arith. Con. 
Arith. Prob. 
Total Arithmetic 

Composite 



SAC > 140A-LCEs:LCE=LCS^ 
(NOA-LCE>LCS) 

SAC > NOA-LCE=IiCE> LCS 

SAC >NOA-LCE=LCE=LCS 

SAC > LCE=NOA-LCE=LCS 
(LCE=>LCS) 

SAC >LCE=NOA-LCE=LCS 
(LCE> LCS) 

SAC i» LCE > NOA-LCE=LCS 

SAC LCE=NOA-LCE=LCS 
(LCE2>LCS) 

SAO LCS=LCE=NOA-LCE 

SAO NOA-LCE=.'jCE=LCS 

SAO LCE=N0A-LCE=LCS 

SAC> LCE=N0A-LCE=LCS 

SAC^ LCE=N0A-LCE=LCS 

SAC=NOA-LCE=LCEi» LCS 

SAC> LCE=NOA-LCE=LCS 
(LCE> LCS) 



Analysis inappropriate 

SAC=NOA-LCE> LCE=LCS 
SAC=NOA-LCE> LCS=LCE 
Analysis inappropriate 

Analysis inappropriate 

SAO> LCE=NOA-LCE=LCS 
SAC2fcN0A-LCE> LCE=LCS 

Analysis inappropriate 
SAC=NOA-LCE ^ LCS=LCE 
Analysis inappropriate 
Analysis inappropriate 
SAC=NOA-LCE=LCS=LCE 
NOA-LCE=SAC=LCE=LCS 
SAC=NOA-LCE> LCE^LCS 



ERIC 



SAC>N0A-1XE»LCE> LCS Analysis inappropriate 

tr.^ nf^^^^-,^ inequality signs reflect aptjllcation of t-tests 

' analysis of variance. EqJU sicma indicate 
that differences xn mean scores wore not statiaticaily significant. 

of pii^™^7^^fl^?i^''??"^H^y f^^""^ reflect visual examination 
of Pxgures 7, 8, 11, 12, 14 and 17 as well as analyfis of 
covari ance . * 

^The groups are listed according to the order of their mean 
scores, with the highest first. 



of B^Qic Skills according to analysis of variance and analy- 
sis of covariance. The format of this table is similar to 
the one for Table 13 which is explained on page 213. 

Long-term Pattf>rna 

In order to place the findings of the Year Pour 
(1967-68) study in perspective, summaries for the fourth 
grade (Sart$)le I) and third grade (Samplt II) sxibjects are 
outlined in Tables 15 and 16. The findings related to the 
"control" group in Year Two (1965-66) are omitted because 
the selection of the members of that group violated the 
usual research design procedures, i.e.. the control pupils 
were not similar to the experimental pupils in ethnicity, 
social class, academic achievement and language ability. 
(However, it should be noted that the inclusion of that 
"control" group in the Year Two (1965-66) study did reveal 
the standing of the escperimental groups in relation to the 
total community. ) The nomenclature of the Year Four study 
is used, i.e., intensive English treatment * LCE and in- 
tensive Spanish treatment = LCS. 

Sample I 

The summary of findings related to the fourth grade 
research aaiaple reveals that: 

(1) At the end of first grade, there were no dif- 
ferences among the groups on 'reading readiness; 



Table 15 



Summary of Findings Related to Sanple I (children 
who entered first grade in 1964) in the San 
Antonio Language Research Project at the 
End of Year Four (1967-68) 



First Grade (Horn, "Year One Findings") 
Criterion: Reading Readiness 
NOA = LCE = LCS 

Second Grade (Arnold, "Year Two Findings") 
Criterion: Reading 
NOA > LCE = LCS 

Third Grade (Knight, "Year Three Findings") 
Criterion: Reading 
nine significant differences— 

NOA> LCE > LCS 
six significant differences — 
lce>noa:?-lcs 

Fourth Grade (Salinas, "Year Four Findings") 
NOA group discontinued 

Criterion: Iowa Tests of Basic Skills 

analysis of variance - 

LCE>SACVLCS 

analysis of covariance with IQ as covariable 

LCE^ LCS ^SAC 

NOA = Non-Oral-Aural 

LCE = Language-Cognition English 

I^S SB Language-Cognition Spanish 

SAC = San Antonio Curriculum (control) 



Table 16 



Sumary of Findings Related to Sanjjle II (children 
who entered first grade in 1965) in the San 
/^ntonio Language Research Project at the 
End of Year Four (1967-68) 



First Gra de (Arnold. "Year Two Findings") 
Criterion : Reading 
LCE>NOA = ICS 
Criterion: Reading Readiness 
high pretest scores — 

LCE = NOA = LCS 
low pretest scores-- 
LCE = NOA > LCS 

Second Grad e (Knight, "Year Three Findings") 
Criterion: Reading 
LCE>N0A;>'LCS 

Third Grade (Salinas, "Year Four Findings") 
Criterion; Iowa Tests of Basic Skills 
analysis of variance~ 
SAO" LCE = NOA-LCE;^LCS 

analysis of covariance with IQ as covariable 

SAC = NOA-LCE>LCE = LCS 

NOA = Non-Oral-Aural 

tCE = Language-Cognition English 

ICS = Language-Cognition Spanish 

N0A-K:E = NOA for two years, LCE in third grade 

SAC = San Antonio Curriculum (control) 
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(2) At the end of second grade, the NOA group was 
superior to the LCE and u:s groups in reading while the LCE 
and LCS groups were about the same; 

(3) At the end of third grade, nine significant 
differences on a reading measure favored the NOA group and 
six favored the LCE group. The LCS group was always third 
in order: and 

(4) At the end of fourth grade the LCE group had 

a slight advantage on the Iowa Teats of Basic skills Whether 
or not IQ was controlled. When IQ was not considered, the 
control (SAC) group had higher scores than the LCS group. 
When IQ was considered, the LCS group scored slightly higher 
or about the same as the control (SAC) group. 

In Sani)le I a trend appeared to be developing up 
through third grade which favored the NOA treatment. Un- 
fortunately, the NOA treatment was discontinued after third 
grade so there is no way of knowing for sure whether or not 
the LCS group would have scored highest on the Iowa Tests 
of Basic Skilla if the NOA group had been included in the 
conparison. Although the NOA pupils received special in- 
struction with science materials, they probably received 
approximately the same kind of oral language instruction as 
the "control" pupils, e.g., no planned formal language in- 
struction at »11. Therefore, it is assumed here that the 
NOA group of the first three years was similar to the 
"control" group of the fourth year. It follows then that 
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the trend favoring the groups not receiving the eaqjerimental 
language curricula was reversed in fourth grade and the LCE 
treatment proved slightly more effective. 

Sample II 

The summary of findings related to the third grade 
research sairple in Year Four shows that: 

(1) At the end of first grade, when reading readi- 
ness was measured, the LCE, LCS and NOA groups were about 
the same except when scores of pupils with low pretest scores 
were conpared. In this case, the LCS group scored lower than 
the other two groui)S. When reading skills were measured, the 
IX2E group was superior to the NOA and LCS groups, which were 
about the same. 

(2) At the end of second grade, tests of reading 
showed that the LCE group was superior to the NOA group, 
which was in turn superior to the LCS group. 

(3) At the <2nd of third grade, the "control" group 
scored highest on tho Iowa Tests of Basic Skills . There 
were no consistent di.fferences between the LCE and NOA- LCE 
groups. The LCS group scored lowest. When IQ was controlled, 
the NOA-IX:e scores were found to be greater than those of the 
LCE group and about the same as the SAC scores. The LCS 
scores were about the same as the LCE scores. 

For the firot two years, the LCE treatment appeared 
to be slightly more effective than other treatments for the 
children in Sample II. The test scores in the third grade 



did not provide evidence that the LCE treatment was more 
effective than the language activities (not defined) in the 
regular San Antonio curriculum. Thus, for the first time, 
pupils in the intensive language programs who had teachers 
who were in their second year of experience with the new 
program, did not out-perform pupils receiving no special 
language instruction. 

LIMITATIONS 

Instnunentation 
Performance on the Iowa Tests of Basic Ski lis de- 
pends heavily on the ability to read. On every subtest, 
even the arithmetic sections, the child has to read the 
questions before he can display the slcills that are sup- 
posed to be assessed in that section of the test. The 
mean scores of the fourth grade groups ranged between 3.5 
(third grade, fifth month ^ and 3.8. Those of the third 
grade experimental groups ranged between 2.6 and 2.9, while 
the SAC group mean was 3.3. These low reading levels indi- 
cate that the children in the research sanple were likely 
to have been unduly handicapped by their reading abilities 
on the tests assessing skills in other areas. 

Adding to the problem is the fact that both the 
children and their teachers, when faced with a test that 
appears to them to be very difficult, do not seem to take 
the testing sessions seriously. Thus the children in the 
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research sainples may not have performed as well as they might 
have if they had had more confidence in themselves and in the 
process of taking exams. It may be true, however, that these 
factors relating to the difficulty of the instrumentation 
used in this study affect all treatment groups about the same. 

Sample Size 

By the fourth year (1967-68) of the Project, the 
sainple sizes were reduced to as few as twenty-nine in the 
fourth grade LCE treatment group compared to the 250 chil- 
dren in each treatment group in 1964. The number of chil- 
dren in eacperimental classes was still large, but the number 
who had received the same treatment for three or four years 
was much smaller. Unfortunately there was no way of knowing 
whether the children in the research sample in Year Four 
fairly represented the spectrum of abilities of the original 
sample. It is impossible to say whether or not the effects 
of the intensive language instruction were the same on 
those children who advanced to the third or fourth grade in 
the same experimental treatment as the effects would have 
been on all the children who started receiving escperimental 
language instruction in first grade. 

Lack of Monitoring 
Because of lack of resources, the teachers were not 
supervised on a daily basis either by the Project staff or 
by the supervisory staff of the San Antonio Independent 
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School District. The teachers were not asked to report the 
amount of time they spent teaching the experimental curric- 
ulum but were assumed to follow project directions. There- 
fore, even though the e3q>erimental design called for instruc- 
tion in the special curriculum for an hour a day for 140 days 
a year, no one knows if these instructions were actually 
carried out. Observers suspect that there was great varia- 
tion in use of the experimental curriculum from classroom 
to classroom. 

Another source of variation occurred in the degree 
to which the teachers used the oral-aural techniques in 
lessons during the day. It is likely, though, that teachers 
in the Project for the first or second year would not have 
felt enough confidence to try developing their own language 
drills for use the rest of the day. Thia source of varia- 
tion might be more irrqportant in later years of research. 

Some deviation from the pleuined instruction occurred 
because production of materials was delayed. This is known 
to have most strongly affected the implementation of the LCS 
treatment and the fourth grade LCE classes the year they were 
in third grade. 

■Nature ' of the San Antonio Curriculum 
The conparisons made in the research in the Project 
generally referred to the regular san Antonio curriculvun. 
For instance, one of the three methods of developing reading 
readiness in the Year One (1964-65) study was whatever the 



225 

San Antonio Independent School District stipulated, in 
later years the intensive language programs were conpeired 
to whatever methods were used by the teachers in the non- 
experimental classes. Unfortunately, no one knows spe- 
cifically what methods the teachers in non-eacperimental 
classes used or even what methods were used by the teachers 
in the ICE and ICS classes during the part of the day not 
involving the language/science materials. It seems safe to 
assume that although teachers' methods and styles varied 
somewhat, aiidiolingual language teaching methods, with few 
exceptions, were not used outside of Project classrooms and 
probably little attention was paid to structured language 
development Egr se. A few of the NOA teachers may have 
occasionally used audiolingual techniques but they did not 
do so on a regular systematic basis. Probably none of the 
"control" teachers used them. 

^ ^ Assessment of Oral Language 

The objective of the San Antonio Language Research 
Project was to iirprove oral language abilities in order to 
enhance academic achievement of Spanish-speaking dildren. 
Because oral language assessment instruments which have 
general acciBptance were not available, no firm conclusions 
could be reached about the effects of the different language 
teaching methods used. The use of reading readiness tests, 
reading tests and the Iowa Tests of Baaie skilly was pre- 
dicated on the assunption that the language of the children 
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in the eaqjerimental classes had inproved. The data from the 
Vfritten tests would show whether or not the in5>roved lan- 
guage abilities would affect other kinds of school perfor- 
mance. The problem with this assumption and the lack of 
data on oral language daring every year of the Project is 
that if significant differences are not found on the reading 
and other tests, then one does not know whether the experi- 
mental curriculum failed to develop language or whether 
improved language has no effect on other skills. 

Assignment to Treatment Groups 
VJhen the Project was begun the school administrators 
agreed to assign pupils to treatment groups on a reindom 
basis. There is evidence that in a number of cases, pupils 
of lower age and less developed English abilities were 
assigned to the UlE classes. To the extent that the sta- 
tistical techniques used in the various studies did not 
account for these initial differences, the effects of the 
intensive English instruction may have been underestimated. 

Retention Policy 
The failure rate of first grade Spanish-speaking 
children had been very high in some schools in San Antonio. 
Principals of Project schools agreed to promote all children 
in the e3q)erim©ntal classes. In a very few cases this was 
not done. Data were eliminated in this study for any child 
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in the third grade LCE, LCS and NOA-LCE classes who had been 
retained. 

A more difficult problem arose in that it is not 
known whether or not the schools applied the new policy of 
no failures to the non-experimental classes, if the teach- 
ers maintained a policy of failing the lowest achieving 
children only in the SAC (control) classes, these classes 
would have children not conparable to those in thft experi- 
mental classes. This lack of similarity might be reduced 
by the extent to which the retainees of the next higher 
grade were members of the research sanple classes. 

Examination of the chronological ages of the children 
in the "control" classes indicated that at least one of the 
twenty-one fourth grade SAC pupils was overage for his grade 
and at least eight of the sixty-eight third grade SAC pupils 
were overage. There are other reasons besides failure for 
being overage but, whatever the ejcplanation , these overage 
children probably represent the lower spectrum of abilities 
or achievement levels. 

The interpretation of the conparisons of achievement 
of experimental and control pupils in this study is therefore 
limited by the lack of exact information on the composition 
of the control classes. 

... Teacher Variables 

By the fourth year of the Project, the several chil- 
dren in each treatment in the research sanples had received 
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instruction from many different teachers. It is hoped that 
this fact will mean that teacher variation did not unduly 
affect the data. 

In addition to the usual differences in teacher com- 
petency, other facwirs related to the Project contributed 
to teacher differences. Success which pupils would have had 
with the Spanish lessons depended in part on their teachers' 
mastery of Spanish. The teachers who participated in the 
Project had to spend much more time in preparation and in 
inservice meetings than non-Project teachers. This extra 
bxirden may have affected the teachers' attitudes and willing- 
ness to cooperate in varying degrees. The principals of 
the various buildings also differed in their willingness to 
cooperate. Finally, the Project staff supervisors apparently 
varied a great deal in the performance of their duties due. 
in part to different philosophical joints of view and in 
part to the erratic observation schedules, 

" - CONCLUSIONS 

Within the limitations set forth above, the data in 
this study allow these conclusions to be made about the 
effects on children in san Antonio Language Research Project 
schools at the conclusion of the Project's fourth year (1967- 
68) in the LCE, NOA-LCE and LCS treatments: 

(1) The ICS (Language-Cognition Spanish) treatment 
was not effective in producing differential achievement as 
measured by the Iowa Tests of Basic Skills. 
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(2) According to data from the fourth grade sanple, 
the uys. (Language-Cognition English) treatment was slightly 
more effective in producing differential achievement than 
language activities of the regular San Antonio curriculum. 
Significant differences favoring the LCE treatment were 
to'ond in comparisons of mean scores on the following tests 

Iowa Tests of Basic Skills t Language Skills— Spelling; 
Work-Study Skills— Total; Arithmetic Skills— Arithmetic con- 
cepts; and Arithmetic Skills— Total. When IQ was controlled 
the significant differences favoring the LCE treatment were 
found in Language Skills— Usage; and Language Skills—Total. 

(3) According to data from the third grade sanple, 
the SAC (San Antonio curriculum) was more effective in pro- 
ducing differential achievement as measured by the Iowa Tests 
of Basic Skills. When IQ was controlled, the NOA-LCE (Nbn- 
Oral-Aural/Language-Cognition English) treatment was also 
more effective than the eacperimental language treatments— 
LCE and LCS. 

(4) The superiority of the SAC treatment at only 
the third grade level may be an anomaly or it may represent 
a pattern. 

(5) On the subscales where analysis of covariance 
was not appropriate because of interaction effects, figures 
displaying the regression lines revealed a tendency for the 
SAC and NOA-I^E treatments to be more effective at the 
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higher IQ levels and for the LCE and LCS treatments to be 
more effective at the lower IQ levels. 

(6) The grade equivalent scores and the percentile 
rankings of the children on the Iowa Tests of Basic Skills 
in all the treatments— experimental as well as control—were 
very low. At the end of fourth grade the Composite mean 
scores of the Sample I children ranged between 3.6 (third 
grade, sixth month) and 4.0 (beginning of fourth grade). At 
the end of third grade the Coiiposite mean scores of the 
Sample II children ranged between 2.7 (second grade, seventh 
month) and 3.5 (third grade, fifth month). The percentile 
rankings ranged between the fourth and thirty-ninth per- 
centiles for children in fourth grade and between the third 
and sixty-second percentiles for children in the third grade. 

DISCUSSION 

Spanish Treatment Findings 
The most consistent set of findings in the Project 
research related to the ineffectiveness of the LCS treatment, 
on almost no measure did the LCS groups receive scores higher 
than the other experimental. or control groups. These results, 
however, should not be interpreted as an indictment against 
the use of Spanish in the clasaroom or against bilingual edu- 
cation. The major instruments used for assessment, both oral 
and written, were in English, not Spanish, 'rte data from the 
administration of the Spanish reading tests in Year Two 
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(1965-66) and Year Three (1966-67) were not favorable to the 
LCS treatment, but instruction in reading in Spanish was not 
a conponent of the e3<perimental or the regular curriculum. 

In addition to the lack of proper testing, certain 
shortcomings were seen in the Spanish materials which must 
have been factors in the effectiveness of the treatment. 
The Spanish lessons consisted of translations of the English 
lessons. Thus, they were not based, even at first and second 
grade levels, on any systematic outline of the syntactic 
structures of Spanish or on any other logical plan. More- 
over, the development of the materials was not preceded by 
any diagnosis of the language abilities of the pupils. It 
is also known that the Spanish curriculum materials fre- 
quently arrived in San Antonio late so that the teachers 
were not able to teach new material in the daily time 
allotted to their experimental treatment. This iacjc of 
curriculum materials was especially serious for the Spanish- 
surname teachers who were not fluent in Spanish (the Project 
administrators had erroneously assumed that all Spanish- 
surname teachers spoke Spanish). Therefore, the negative 
findings in this study with regard to the Spanish treatments 
are not surprising. 

English Treatment Findings 
It was assumed that if the experimental English cur- 
riculum were effective, there would be significant differences 
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favoring the u:e group on scores of the Iowa Tests of Basic 
Skills. Improved skills in language would be reflected in 
higher scores on the Vocabulary. Usage and perhaps the 
Spelling tests. Inproved language would be reflected also 
in higher Reading Coirprehension scores. Because the content 
used by all three treatments (LCE, LCS and NOA) included 
lessons with science and math concepts and processes, the 
experimental treatment groups would be ea^ected to score 
higher than the "control" groups on both arithmetic tests 
and on the Graphs and Map Reading subtests. Finally, if the 
e3q>erimental language curricula were effective in producing 
growth in English language, the LCE and LCS groups would be 
expected to score higher on all tests because their iinproved 
language skills would enable them first, to read the ques- 
tions better and second, to be more likely to know the 
answers because they had understood their teachers better. 

All ther expected differences did not occur. This 
does not mean that intensive language, or language activities 
in combination with AAAS lessons, are not effective. The 
findings of this study indicate only that the materials were 
not as effective as was hoped for in the San Antonio Language 
.Research Project during the years 1964 to 1968. No other 
conclusion can be fairly made. 

San Antoni o Curriculum Findings 
There is no obvious reason why the SAC group was 
superior at the third grade level but not at the fourth 
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grade level. The results from the two grade levels may be 
an une3q>lained contradiction. On the other hand, it is pos- 
sible that children learn a great deal of language by being 
exposed to it in instructional settings in traditional class- 
rooms. However, at some point, without intensive language 
training, their growth in language reaches a plateau. This 
may have happened to the SAC children in the fourth grade. 
This hypothesis is supported by Taylor's findings that, fifth 
grade "control" children scored lower than fourth grade 
"control" children in Fluency and Total Language on her test 
(Taylor's fifth grade and fourth grade "control" pupils came 
from the Scune population as the fourth and third grade pupils, 
respectively, -Jn this study). An alternative eaqjlanation is 
that the effects of intensive language instruction may be 
long run in nature and do not necessarily reveal themselves 
until after foxuc years of continuous instruction. 

Since no oral language assessment of the pupils was 
made during Year Four (1967-68), no conclusion can be reached 
as to whether the u:e and ICS treatments were not effective 
in in5)roving academic achievement even though language growth 
occurred, or whether the experimental language treatments 
produced differential achievement in neither language nor 
academic areas. 

NOA-LCE Treatment Findings 
The finding that the scores on the Iowa Tests of Basic 
S^"-j,ls of the LCE and NOA-LCE treatment groups were very 
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similar according to analysis of variance may be interpreted 
in at least three ways. (All three assume that inproved 
language abilities would be reflected in scores of written 
tests in various curriculum areas.) 

(1) The NOA treatment (which the third grade pupils 
had received for two years) was not very different from the 
LCE treatment. This may have been because the NOA teachers 
used the oral-aural techniques they learned about by talking 
to Ix:e teachers. If this is true, it implies that the ex- 
tensive inservice sessions the LCE teachers attended may not 
be any more effective for developing teaching techniques 
than informal teacher-to-teacher contacts. 

(2) The ICE treatment was not any more effective in 
developing oral language abilities than the informal language 
activities that were carried on by NOA teachers and pre- 
sumably by teachers of non- experimental classes. 

(3) The NOA treatment, was just as effective as the 
LCE treatment because of a specific characteristic of the 
approach to teaching science. The amount of teacher talk is 
generally much smaller than in the lectvire mode typically 
used in other subject areas. The pupils talk to the teacher 
more than usual in large group discussions and to each other 
in araall group discussions as they work on science project. 

Influence of iq 
There was a tendency at both grade levels for the chil- 
dren with higher IQ's to perform better than other treatment 
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group members on the Iowa Tests of Basic sicill« if they were 
in the SAC treatment group, while children with lower IQ's 
tended to perform better if they were in the experimental 
language treatments (lce and LCS) than if they were in the 
SAC treatment. This result was not anticipated by the San 
Antonio Language Research Project staff members. There is 
no way of knowing for sure which interpretation of the data 
would be most accurate. However, at least one hypothesis 
appears plausible at this time. 

If child performance is systematically handicapped 
on an IQ test because of limited language ability, then it 
vould appear that it is possible that children with little 
facility in English were helped by the experimental English 
or Spanish curricula. Children with little control of 
English, and therefore lower measured IQ's, who happened to 
be assigned to SAC treatment classes, were not given planned 
and prolonged instruction in language. Therefore, they were 
handicapped in their development of the various skills tapped 
^ ^'^^ Iowa Tests of Basic skills. However, if these children 
were in the ICE or LCS classes, they were given the kind of 
help in language they needed and thus were able to perform 
better on the Iowa T^sts of Basic skills . 

Children who had managed to learn a great deal of 
English as indicated by superior performance on the IQ tests 
and received the SAC treatment proceeded to develop their 
academic skills with no linguistic handicap. On fhe other 
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hand, children with well developed English language skills 
in the ICE and UZS treatment groups were actually held back 
in conqparison to their peers in the SAC treatments, because 
their teachers were spending time providing language lessons 
that these children did not need. 

This interpretation inplies a need for teachers to 
tailor their oral language instruction to the specific needs 
of their individual children. Approaches such as the one 
used in San Antonio in which language lessons are taught to 
entire classes with no grouping or individualization are 
inefficient and may even prove harmful to children whose 
needs are not being met by the large group lesson. 

The need for individualization of oral language in- 
struction means that there is an even greater necessity for 
two things: 

(1) Development and refinement of instruments for 
assessing oral language which teachers can use as diagnostic 
pretests. 

(2) Development of methods for training teachers 
how to assess oral language informally and how to individu- 
alize instruction in general in the self-contained classroom. 

Overall Results 
Interpretations of the favorable results for the SAC 
treatment at the third grade level and the LCE treatment at 
the fourth grade level should be made in view of the fact 
that the scores of the children in all the treatments, 
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esjperimental and control, at both grade levels were very low. 
Both McDowell in his study of first graders in Year One (1964- 
65) and Arnold in his study of first graders in Year Two 
(1965-66) found that the children in the Project schools 
performed very poorly on a variety of intelligence, reading 
and reading readiness tests in conparison to other children 
in the community who came from diverse socioeconomic classes 
and ethnic and language groups. There is a clear need for 
a fresh look at the problems of the children in the Project 
schools, and all children like them, and a necessity for 
intensive efforts to find ways to improve their instruction 
including, perhaps, a regrouping of the entire educational 
structure. 

The many difficulties these children face in school 
obviously are diverse and severe. The San Antonio Language 
Research Project tried to identify the problems and attert5)ted 
to alleviate many of them. That the attenpt was not success- 
ful is an indication of the difficulty and conplexity of the 
situation. 

Worth of the Project 
Since the esqserimental treatments did not produce 
great improvements in skills in reading and other academic 
areas, an appropriate question is whether or not the Project 
was worth the time, money and effort invested in it. The 
opinion of this writer is that the San Antonio Language 
Research Project did make contributions in at least the fol- 
lowing ways: 
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(1) Data which were gathered established dramatic- 
ally the inappropriateness for linguistically different 
students of a number of reading, reading readiness and in- 
telligence tests and the lack of adequate oral language 
assessmeht instruments. 

(2) By highlighting the problems to be faced in 
doing research in public schools, the San Antonio Language 
Research Project provided information to other groups about 
what they should strive to achieve, avoid and/or overcome. 

(3) The Project "ploughed ground" for language 
programs which followed and are continuing and which focus 
upon problems of linguistically different learners. 

RECOMMENDATIONS 

The findings of this study and the experience gained 
as a researcher in the San Antonio Language Research Project 
proitpt the proposal of these recommendations: 

(1) Any group doing longitudinal methodological 
research in public schools should take care to: 

(a) Obtain prior commitment of adminis- 
trators and principals, which must be 
communicated to the staff. 

(b) Assure th^tTuj^icient monitoring of 
teacher and pupil behavior be done 
so that data obtained in testing are 
'"*=5aningful and can be interpret at ed. 
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(c) Provide continuing staff development 
in the way of both consultant services 
and moral support. 

(d) Assure that critical teacher feedback 
to the materials development staff is 
transmitted and considered and acted 
upon. 

(e) Include as large a con^xjnent as possible 
which explains to all personnel involved 
the research and development process, 
why evaluation of both teacher and pupil 
performance and attitude is needed, and 
why instructional monitoring is necessary. 

(f ) Secure services of a Project director 
who is likely to stay for the length 
of the project. 

(2) Additional research with regard to oral lan- 
guage instruction and testing should be conducted in order 
to determine: 

(a) What kinds of oral language instruction 
are most effective in inproving oral 
language proficiency. 

(b) What amoxint of oral language instruction 
is appropriate. 

(c) If oral language instruction should be 
a separate component or a part of all 
areas of the curriculxan. 
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(d) How oral language instruction should 
be related to other areas of instruc- 
tion. 

(e) What aspects of oral language are most 
closely related to success in other 
school endeavors. 

(f ) Whether instruction in other areas 
should be modified to meet the needs 
of non-native English speakers and, if 
so , how. 

(g) If special materials need to be devel- 
oped for oral language instruction ^ or 
if emphasis should be on staff develop- 
ment to train teachers to construct 
their own language drills and activities 
which can be used with whatever materials 
are available in the school district. 

(3) Work should continue on the development and 
refinement of instruments and methods for assessing oral 
language facility. Progress in finding answers to the ques- 
tions listed in the second recommendation hinges on the 
availability of such instruments and methods. 



APPENDIX A 
ADDITIONAL SOCIOECONOMIC DATA 



Income 

Considering the poverty line to be $3,000 income per 
family per year, the following percentage of families are 
"poor" in the Southwest as of 1960: 

Anglo — 15.9 

Spanish- surname — 34. 8^ 
In Texas in 1960 over half of all Spanish- surname families 
had cinnual incomes lander $3,000.^ 

Housing 

Housing is considered overcrowded if there are 1.01 
or more persons per room, in 1960 the following percentage 
of housing was "crowded": 

Southwestern metropolitan areas: 
Anglo — 7.7 

Spanish-surname— 34. 6 

Texas: Anglo — 9.4 

- . , 3 
Spanish-surname — 46 . 3 

Iprank G. Mittlebach and Grace Marshall, "The Burden 
of Poverty," Mexican- American Study Project Advanc e Report 5, 
University of California, Los Angeles, 1^66, pp. 2-3. 

^Robert w. MacMillan, "A Study of the Effect of Socio- 
economic Factors on the School Achievement of Spanish- Speaking 
School Beginners," Unpublished Doctoral Dissertation, The 
University of Texas, Austin, 1966, p. 45. 

^Mittlebach and Marshall, "The Burden of Poverty," 

p. 44. 
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Occupation 

Sixty percent of all enployed urban Mexican-American 
males were in three "manual" classifications ^^.ccording to 
the Census Bureau: operatives (semiskilled), craftsmen and 
laborers. Only nineteen percent are in white collar occu- 
pations: professional, managerial, clerical and sales. 
Almost half of all enployed Anglos are white collar workers.'^ 

Unemployment 

The schools in the San Antonio Language Research 
Project are in an area which has the highest subemployment 
rate of ten areas studied by the U.S. Department of Labor, 
areas which included Harlem, East Harlem and Bedford- 
Stuyvesant.2 Forty-seven percent of the persons in the San 
Antonio slum areas who are or should be workers are sub- 
enployed.*^ * 



M«r-v«^o 2 ® Fogel, "Mexican-Americans in Southwest Labor 
Markets, Mexican- American Stu dy Project Advance Repo rt 10, 
University of California, Los Angeles, 1967, pp. 15I-I6. 

^^y.' ^The subemployment index includes people who are: 
fnlffL looking for work; working part time although 
full time work is wanted; heads of households under six who 
^^^^..^'rv, ^ ""^^^ VADrking full time; or not 

f Si tiL^ob! earning less than $56 a week on a 

nr-^c-i/i^r,?^*^: Department of Labor, Manpower Report of the 
ufilffSron ^J^eport on Manpower k^mTT^ ementr R^«nnr-~. 

Utilization, and Trainin g, Transmi tted to the Congress, 

April, 1967, united states Department of Labor, Washington, 
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